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METHODS FOR IDENTIFYING RISK OF OSTEOARTHRITIS AND 
TREATMENTS THEREOF 

Field of the Invention 

[0001] The invention relates to genetic methods for identifying risk of osteoarthritis and treatments 
that specifically target such diseases. 

Background 

[0002] Osteoarthritis (OA) is a chronic disease usually affecting weight-bearing synovial joints. 
There are approximately 20 million Americans affected by OA and it is the leading cause of disability in 
the United States. In addition to extensive human suffering, OA also accounts for nearly all knee 
replacements and more than half of all hip replacements in the United States. Despite its prevalence, OA 
is poorly understood and there are few treatments available besides anti-inflammatory drugs and joint 
replacement. 

[0003] Most commonly affecting middle-aged and older people, OA can range from very mild to 
very severe. It affects hands and weight-bearing joints such as knees, hips, feet and the back. Knee OA 
can be as disabling as any cardiovascular disease except stroke. 

[0004] OA is characterized by the breakdown of cartilage in joints. Cartilage in joints cushions 
the ends of bones, and cartilage breakdown causes bones to rub against each other, causing pain and loss 
of movement. Type II collagen is the main component of cartilage, comprising 15-25% of the wet 
weight, approximately half the dry weight, and representing 90-95% of the total collagen content in the 
tissue. It forms fibrils that endow cartilage with tensile strength (Mayne, R. Arthritis Rhuem. 32:241-246 
(1989)). 

Summary 

[0005] It has been discovered that certain polymorphic variations in human genomic DNA are 
associated with osteoarthritis. In particular, polymorphic variants in a locus containing a APOL3 region 
in human genomic DNA have been associated with risk of osteoarthritis. 

[0006] Thus, featured herein are methods for identifying a subject at risk of osteoarthritis and/or a 
risk of osteoarthritis in a subject, which comprise detecting the presence or absence of one or more 
polymorphic variations associated with osteoarthritis in or around the loci described herein in a human 
nucleic acid sample. In an embodiment, two or more polymorphic variations are detected in two or more 
regions of which one is the APOL3 region. In certain embodiments, 3 or more, or 4, 5, 6, 7, 8, 9, 10, 11, 
12, 13, 14, 15, 16, 17, 18, 1 9 or 20 or more polymorphic variants are detected. 
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[0007] Also featured are nucleic acids that include one or more polymorphic variations associated 
with occurrence of osteoarthritis, as well as polypeptides encoded by these nucleic acids. In addition, 
provided are methods for identifying candidate therapeutic molecules for treating osteoarthritis, as well 
as methods for treating osteoarthritis in a subject by identifying a subject at risk of osteoarthritis and 
treating the subject with a suitable prophylactic, treatment or therapeutic molecule. 

[0008] Also provided are compositions comprising a cell from a subject having osteoarthritis or at 
risk of osteoarthritis and/or a APOL3 nucleic acid, with a RNAi, siRNA, antisense DNA or RNA, or 
ribozyme nucleic acid designed from a APOL3 nucleotide sequence. In an embodiment, the RNAi, 
siRNA, antisense DNA or RNA, or ribozyme nucleic acid is designed from a APOL3 nucleotide 
sequence that includes one or more polymorphic variations associated with osteoarthritis, and in some 
instances, specifically interacts with such a nucleotide sequence. Further, provided are arrays of nucleic 
acids bound to a solid surface, in which one or more nucleic acid molecules of the array have a APOL3 
nucleotide sequence, or a fragment or substantially identical nucleic acid thereof, or a complementary 
nucleic acid of the foregoing. Featured also are compositions comprising a cell from a subject having 
osteoarthritis or at risk of osteoarthritis and/or a APOL3 polypeptide, with an antibody that specifically 
binds to the polypeptide. In an embodiment, the antibody specifically binds to an epitope in the 
polypeptide that includes a non-synonymous amino acid modification associated with osteoarthritis (e.g., 
results in an amino acid substitution in the encoded polypeptide associated with osteoarthritis). 

Brief Description of the Drawings 

[0009] Figure 1 shows proximal SNPs in a APOL3 region in genomic DNA. The position of each 
SNP in the chromosome is shown on the x-axis and the y-axis provides the negative logarithm of the p- 
value comparing the estimated allele to that of the control group. Also shown in the figure are exons and 
introns of the gene region in the approximate chromosomal positions. 

Detailed Description 

[0010] It has been discovered that a polymorphic variant in a locus containing a APOL3 region is 
associated with occurrence of osteoarthritis in subjects. Thus, detecting genetic determinants associated 
with an increased risk of osteoarthritis occurrence can lead to early identification of a predisposition to 
osteoarthritis and early prescription of preventative measures. Also, associating a APOL3 polymorphic 
variant with osteoarthritis has provided new targets for screening molecules useful in treatments of 
osteoarthritis. 
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Osteoarthritis and Sample Selection 

[0011] Osteoarthritis (OA), or degenerative joint disease, is one of the oldest and most common 
types of arthritis. It is characterized by the breakdown of the joint's cartilage. Cartilage is the part of the 
joint that cushions the ends of bones, and its breakdown causes bones to rub against each other, causing 
pain and loss of movement. Type II collagen is the main component of cartilage, comprising 15-25% of 
the wet weight, approximately half the dry weight, and representing 90-95% of the total collagen content 
in the tissue. It forms fibrils that endow cartilage with tensile strength (Mayne, R. Arthritis Rhuem. 
32:241-246(1989)). 

[0012] Most commonly affecting middle-aged and older people, OA can range from very mild to 
very severe. It affects hands and weight-bearing joints such as knees, hips, feet and the back. Knee OA 
can be as disabling as any cardiovascular disease except stroke. 

[0013] Osteoarthritis affects an estimated 20.7 million Americans, mostly after age 45, with women 
more commonly affected than men. Physicians make a diagnosis of OA based on a physical exam and 
history of symptoms. X-rays are used to confirm diagnosis. Most people over 60 reflect the disease on 
X-ray, and about one-third have actual symptoms. 

[0014] There are many factors that can cause OA. Obesity may lead to osteoarthritis of the knees. 
In addition, people with joint injuries due to sports, work-related activity or accidents may be at increased 
risk of developing OA. 

[0015] Genetics has a role in the development of OA. Some people may be born with defective 
cartilage or with slight defects in the way that joints fit together. As a person ages, these defects may 
cause early cartilage breakdown in the joint or the inability to repair damaged or deteriorated cartilage in 
the joint. 

[0016] Inclusion or exclusion of samples for an osteoarthritis pool may be based upon the following 
criteria: ethnicity (e.g., samples derived from an individual characterized as Caucasian); parental 
ethnicity (e.g., samples derived from an individual of British paternal and maternal descent); relevant 
phenotype information for the individual (e.g., case samples derived from individuals diagnosed with 
specific knee osteoarthritis (OA) and were recruited from an OA knee replacement clinic). Control 
samples may be selected based on relevant phenotype information for the individual (e.g., derived from 
individuals free of OA at several sites (knee, hand, hip etc)); and no family history of OA and/or 
rheumatoid arthritis. Additional phenotype information collected for both cases and controls may include 
age of the individual, gender, family history of OA, diagnosis with osteoarthritis (joint location of OA, 
date of primary diagnosis, age of individual as of primary diagnosis), knee history (current symptoms, 
any major knee injury, menisectomy, knee replacement surgery, age of surgery), HRT history, 
osteoporosis diagnosis. 
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[0017] Based in part upon selection criteria set forth above, individuals having osteoarthritis can be 
selected for genetic studies. Also, individuals having no history of osteoarthritis often are selected for 
genetic studies, as described hereafter. 

Polymorphic Variants Associated with Osteoarthritis 

[0018] A genetic analysis provided herein linked osteoarthritis with polymorphic variant nucleic 
acid sequences in the human genome. As used herein, the term "polymorphic site" refers to a region in a 
nucleic acid at which two or more alternative nucleotide sequences are observed in a significant number 
of nucleic acid samples from a population of individuals. A polymorphic site may be a nucleotide 
sequence of two or more nucleotides, an inserted nucleotide or nucleotide sequence, a deleted nucleotide 
or nucleotide sequence, or a microsatellite, for example. A polymorphic site that is two or more 
nucleotides in length may be 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15 or more, 20 or more, 30 or more, 50 
or more, 75 or more, 100 or more, 500 or more, or about 1000 nucleotides in length, where all or some of 
the nucleotide sequences differ within the region. A polymorphic site is often one nucleotide in length, 
which is referred to herein as a "single nucleotide polymorphism" or a "SNP." 

[0019] Where there are two, three, or four alternative nucleotide sequences at a polymorphic site, 
each nucleotide sequence is referred to as a "polymorphic variant" or "nucleic acid variant." Where two 
polymorphic variants exist, for example, the polymorphic variant represented in a minority of samples 
from a population is sometimes referred to as a "minor allele" and the polymorphic variant that is more 
prevalently represented is sometimes referred to as a "major allele." Many organisms possess a copy of 
each chromosome (e.g., humans), and those individuals who possess two major alleles or two minor 
alleles are often referred to as being "homozygous" with respect to the polymorphism, and those 
individuals who possess one major allele and one minor allele are normally referred to as being 
"heterozygous" with respect to the polymorphism. Individuals who are homozygous with respect to one 
allele are sometimes predisposed to a different phenotype as compared to individuals who are 
heterozygous or homozygous with respect to another allele. 

[0020] In genetic analysis that associate polymorphic variants with osteoarthritis, samples from 
individuals having osteoarthritis and individuals not having osteoarthritis often are allelotyped and/or 
genotyped. The term "allelotype" as used herein refers to a process for determining the allele frequency 
for a polymorphic variant in pooled DNA samples from cases and controls. By pooling DNA from each 
group, an allele frequency for each SNP in each group is calculated. These allele frequencies are then 
compared to one another. The term "genotyped" as used herein refers to a process for determining a 
genotype of one or more individuals, where a "genotype" is a representation of one or more polymorphic 
variants in a population. 
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[0021] A genotype or polymorphic variant may be expressed in terms of a "haplotype," which as 
used herein refers to two or more polymorphic variants occurring within genomic DNA in a group of 
individuals within a population. For example, two SNPs may exist within a gene where each SNP 
position includes a cytosine variation and an adenine variation. Certain individuals in a population may 
carry one allele (heterozygous) or two alleles (homozygous) having the gene with a cytosine at each SNP 
position. As the two cytosines corresponding to each SNP in the gene travel together on one or both 
alleles in these individuals, the individuals can be characterized as having a cytosine/cytosine haplotype 
with respect to the two SNPs in the gene. 

[0022] As used herein, the term "phenotype" refers to a trait which can be compared between 
individuals, such as presence or absence of a condition, a visually observable difference in appearance 
between individuals, metabolic variations, physiological variations, variations in the function of 
biological molecules, and the like. An example of a phenotype is occurrence of osteoarthritis. 

[0023] Researchers sometimes report a polymorphic variant in a database without determining 
whether the variant is represented in a significant fraction of a population. Because a subset of these 
reported polymorphic variants are not represented in a statistically significant portion of the population, 
some of them are sequencing errors and/or not biologically relevant. Thus, it is often not known whether 
a reported polymorphic variant is statistically significant or biologically relevant until the presence of the 
variant is detected in a population of individuals and the frequency of the variant is determined. Methods 
for detecting a polymorphic variant in a population are described herein, specifically in Example 2. A 
polymorphic variant is statistically significant and often biologically relevant if it is represented in 5% or 
more of a population, sometimes 10% or more, 15% or more, or 20% or more of a population, and often 
25% or more, 30% or more, 35% or more, 40% or more, 45% or more, or 50% or more of a population. 

[0024] A polymorphic variant may be detected on either or both strands of a double-stranded nucleic 
acid. Also, a polymorphic variant may be located within an intron or exon of a gene or within a portion 
of a regulatory region such as a promoter, a 5' untranslated region (UTR), a 3' UTR, and in DNA (e.g., 
genomic DNA (gDNA) and complementary DNA (cDNA)), RNA (e.g., mRNA, tRNA, and rRNA), or a 
polypeptide. Polymorphic variations may or may not result in detectable differences in gene expression, 
polypeptide structure, or polypeptide function. 

[0025] It was determined that polymorphic variations associated with an increased risk of 
osteoarthritis existed in a APOL3 region in SEQ ID NO: 1 . In certain embodiments, a polymorphic 
variant at position rs 132659 in the human genome was associated with an increased risk of osteoarthritis, 
and in a specific embodiment, a cytosine at position rsl32659 was associated with an increased risk of 
osteoarthritis. 

[0026] Polymorphic variants in and around the APOL3 locus were tested for association with 
osteoarthritis. These include polymorphic variants at positions in SEQ ID NO: 1 selected from the group 
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consisting of 201, 425, 1095, 2201, 7879, 8395, 8461, 9503, 10304, 10695, 16300, 16444, 17591, 17988, 
19116, 19358, 20300, 20669, 20891, 21451, 21978, 22785, 24248, 24770, 24844, 25066, 25096, 25309, 
25344, 25529, 25537, 25554, 27963, 28134, 28356, 29648, 29986, 30217, 30267, 30315, 30585, 30724, 
30897, 30931, 31080, 31246, 31373, 31463, 31467, 32188, 32288, 32520, 32594, 32657, 32677, 32764, 
32784, 32830, 32872, 33121, 33348, 33952, 34184, 34361, 35026, 35192, 35600, 36033, 36289, 38869, 
39629, 40530, 41621, 42379, 42802, 42865, 43644, 45051 , 45828, 45829, 46257, 47286, 47427, 47963, 
48013, 48229, 48282, 48376, 48404, 49900, 52699, 52897, 53414, 53487, 541 12, 55492, 59766, 60307, 
60701, 60952, 61401, 62379, 62870, 62879, 63499, 64284, 64408, 64760, 65230, 66127, , 6634, 66686, 
66694, 671 13, 67257, 67403, 67609, 68418, 68610, 69629, 70024, 70848, 71428, 71553, 71633, 71768 
,71769, 73039, 73325, 73412, 73547, 73769, 73806, 74467, 74472, 74473, 74482, 74494, 74592, 74670, 
74672, 74714, 74723, 74749, 74861, 74892, 74893, 75176, 75705, 75989, 76027, 77949, 77974, 78167, 
78310, 78415, 78575, 78590, 78709, 78875, 79864, 81316, 81320, 81409, 81737, 81843, 82102, 82833, 
83461, 83624, 83660, 83701, 83708, 83782, 85707, 85717, 86486, 86833, 87115, 87234, 87479, 87561, 
87604, 87674, 87958, 87992, 88019, 88074, 88079, 88115, 88118, 88120, 88135, 88142 ,88143, 88149, 
88340, 88344, 88512, 88521, 88650, 88827, 89230, 89236, 90754, 90984, 91 1 10, 92026, 92954, 93375, 
93794, 94937, 95068, 96188, 97092 and 98812. Polymorphic variants at the following positions in SEQ 
ID NO: 1 in particular were associated with an increased risk of osteoarthritis: 20300, 46257, 87958, 
89236, 30267, 32657, 36289, 38869, 45051, 54112, 60307, 63499, 20891, 52699, 71768, with specific 
embodiments directed to position 46257. In particular, the following polymorphic variants in SEQ ID 
NO: 1 were associated with risk of osteoarthritis: an adenine at position 20300, a thymine at position 
46257, an adenine at position 89236, a guanine at position 30267, an adenine at position 32657, a 
cytosine at position 36289, a guanine at position 38869, a thymine at position 4505 1, a guanine at 
position 541 12, an adenine at position 60307, a thymine at position 63499, a guanine at position 20891, a 
guanine at position 52699, and a cytosine at position 71768. 

[0027] Based in part upon analyses summarized in Figure 1 , a region with significant association 
has been identified in a locus associated with osteoarthritis. Any polymorphic variants associated with 
osteoarthritis in a region of significant association can be utilized for embodiments described herein. For 
example, polymorphic variants in a region approximately 5000 nucleotides in length and spanning 
chromosome positions 34828750 and 34833750 in a APOL3 locus have significant association 
(chromosome positions are within NCBI's Genome build 34) 

Additional Polymorphic Variants Associated with Osteoarthritis 

[0028] Also provided is a method for identifying polymorphic variants proximal to an incident, 
founder polymorphic variant associated with osteoarthritis. Thus, featured herein are methods for 
identifying a polymorphic variation associated with osteoarthritis that is proximal to an incident 
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polymorphic variation associated with osteoarthritis, which comprises identifying a polymorphic variant 
proximal to the incident polymorphic variant associated with osteoarthritis, where the incident 
polymorphic variant is in a APOL3 nucleotide sequence. The nucleotide sequence often comprises a 
polynucleotide sequence selected from the group consisting of (a) a polynucleotide sequence of SEQ ID 
NO: 1-7; (b) a polynucleotide sequence that encodes a polypeptide having an amino acid sequence 
encoded by a polynucleotide sequence of SEQ ID NO: 1 -7; and (c) a polynucleotide sequence that 
encodes a polypeptide having an amino acid sequence that is 90% or more identical to an amino acid 
sequence encoded by a nucleotide sequence of SEQ ID NO: 1-7 or a polynucleotide sequence 90% or 
more identical to the polynucleotide sequence of SEQ ID NO: 1 -7. The presence or absence of an 
association of the proximal polymorphic variant with osteoarthritis then is determined using a known 
association method, such as a method described in the Examples hereafter. In an embodiment, the 
incident polymorphic variant is a polymorphic variant associated with osteoarthritis described herein. In 
another embodiment, the proximal polymorphic variant identified sometimes is a publicly disclosed 
polymorphic variant, which for example, sometimes is published in a publicly available database. In 
other embodiments, the polymorphic variant identified is not publicly disclosed and is discovered using a 
known method, including, but not limited to, sequencing a region surrounding the incident polymorphic 
variant in a group of nucleic samples. Thus, multiple polymorphic variants proximal to an incident 
polymorphic variant are associated with osteoarthritis using this method. 

[0029] The proximal polymorphic variant often is identified in a region surrounding the incident 
polymorphic variant. In certain embodiments, this surrounding region is about 50 kb flanking the first 
polymorphic variant {e.g. about 50 kb 5' of the first polymorphic variant and about 50 kb 3' of the first 
polymorphic variant), and the region sometimes is composed of shorter flanking sequences, such as 
flanking sequences of about 40 kb, about 30 kb, about 25 kb, about 20 kb, about 15 kb, about 10 kb, 
about 7 kb, about 5 kb, or about 2 kb 5' and 3' of the incident polymorphic variant. In other 
embodiments, the region is composed of longer flanking sequences, such as flanking sequences of about 
55 kb, about 60 kb, about 65 kb, about 70 kb, about 75 kb, about 80 kb, about 85 kb, about 90 kb, about 
95 kb, or about 1 00 kb 5 ' and 3 ' of the incident polymorphic variant. 

[0030] In certain embodiments, polymorphic variants associated with osteoarthritis are identified 
iteratively. For example, a first proximal polymorphic variant is associated with osteoarthritis using the 
methods described above and then another polymorphic variant proximal to the first proximal 
polymorphic variant is identified {e.g., publicly disclosed or discovered) and the presence or absence of 
an association of one or more other polymorphic variants proximal to the first proximal polymorphic 
variant with osteoarthritis is determined. 

[0031] The methods described herein are useful for identifying or discovering additional 
polymorphic variants that may be used to further characterize a gene, region or loci associated with a 
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condition, a disease (e.g., osteoarthritis), or a disorder. For example, allelotyping or genotyping data 
from the additional polymorphic variants may be used to identify a functional mutation or a region of 
linkage disequilibrium. In certain embodiments, polymorphic variants identified or discovered within a 
region comprising the first polymorphic variant associated with osteoarthritis are genotyped using the 
genetic methods and sample selection techniques described herein, and it can be determined whether 
those polymorphic variants are in linkage disequilibrium with the first polymorphic variant. The size of 
the region in linkage disequilibrium with the first polymorphic variant also can be assessed using these 
genotyping methods. Thus, provided herein are methods for determining whether a polymorphic variant 
is in linkage disequilibrium with a first polymorphic variant associated with osteoarthritis, and such 
information can be used in prognosis/diagnosis methods described herein. 

Isolated Nucleic Acids 

[0032] Featured herein are isolated APOL3 nucleic acid variants depicted in SEQ ID NO: 1 -7, and 
substantially identical nucleic acids thereof. A nucleic acid variant may be represented on one or both 
strands in a double-stranded nucleic acid or on one chromosomal complement (heterozygous) or both 
chromosomal complements (homozygous). 

[0033] As used herein, the term "nucleic acid" includes DNA molecules (e.g. , a complementary 
DNA (cDNA) and genomic DNA (gDNA)) and RNA molecules (e.g., mRNA, rRNA, siRNA and tRNA) 
and analogs of DNA or RNA, for example, by use of nucleotide analogs. The nucleic acid molecule can 
be single-stranded and it is often double-stranded. The term "isolated or purified nucleic acid" refers to 
nucleic acids that are separated from other nucleic acids present in the natural source of the nucleic acid. 
For example, with regard to genomic DNA, the term "isolated" includes nucleic acids which are 
separated from the chromosome with which the genomic DNA is naturally associated. An "isolated" 
nucleic acid is often free of sequences which naturally flank the nucleic acid (i.e., sequences located at 
the 5' and/or 3' ends of the nucleic acid) in the genomic DNA of the organism from which the nucleic 
acid is derived. For example, in various embodiments, the isolated nucleic acid molecule can contain less 
than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of 5' and/or 3' nucleotide sequences which flank 
the nucleic acid molecule in genomic DNA of the cell from which the nucleic acid is derived. Moreover, 
an "isolated" nucleic acid molecule, such as a cDNA molecule, can be substantially free of other cellular 
material, or culture medium when produced by recombinant techniques, or substantially free of chemical 
precursors or other chemicals when chemically synthesized. As used herein, the term "gene" refers to a 
nucleotide sequence that encodes a polypeptide. 

[0034] Also included herein are nucleic acid fragments. These fragments often have a nucleotide 
sequence identical to a nucleotide sequence of SEQ ED NO: 1 -7, a nucleotide sequence substantially 
identical to a nucleotide sequence of SEQ ID NO: 1-7, or a nucleotide sequence that is complementary to 
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the foregoing. The nucleic acid fragment may be identical, substantially identical or homologous to a 
nucleotide sequence in an exon or an intron in a nucleotide sequence of SEQ ID NO: 1-7, and may 
encode a domain or part of a domain of a polypeptide. Sometimes, the fragment will comprises one or 
more of the polymorphic variations described herein as being associated with osteoarthritis. The nucleic 
acid fragment is often 50, 100, or 200 or fewer base pairs in length, and is sometimes about 300, 400, 500, 
600, 700, 800, 900, 1000, 1100, 1200, 1300, 1400, 1500,2000, 3000, 4000, 5000, 10000, 15000, or 20000 
base pairs in length. A nucleic acid fragment that is complementary to a nucleotide sequence identical or 
substantially identical to a nucleotide sequence in SEQ ID NO: 1 -7 and hybridizes to such a nucleotide 
sequence under stringent conditions is often referred to as a "probe." Nucleic acid fragments often include 
one or more polymorphic sites, or sometimes have an end that is adjacent to a polymorphic site as described 
hereafter. 

[0035] An example of a nucleic acid fragment is an oligonucleotide. As used herein, the term 
"oligonucleotide" refers to a nucleic acid comprising about 8 to about 50 covalently linked nucleotides, 
often comprising from about 8 to about 35 nucleotides, and more often from about 1 0 to about 25 
nucleotides. The backbone and nucleotides within an oligonucleotide may be the same as those of 
naturally occurring nucleic acids, or analogs or derivatives of naturally occurring nucleic acids, provided 
that oligonucleotides having such analogs or derivatives retain the ability to hybridize specifically to a 
nucleic acid comprising a targeted polymorphism. Oligonucleotides described herein may be used as 
hybridization probes or as components of prognostic or diagnostic assays, for example, as described 
herein. 

[0036] Oligonucleotides are typically synthesized using standard methods and equipment, such as 
the ABI™3900 High Throughput DNA Synthesizer and the EXPEDITE™ 8909 Nucleic Acid 
Synthesizer, both of which are available from Applied Biosystems (Foster City, CA). Analogs and 
derivatives are exemplified in U.S. Pat. Nos. 4,469,863; 5,536,821; 5,541,306; 5,637,683; 5,637,684; 
5,700,922; 5,717,083; 5,719,262; 5,739,308; 5,773,601; 5,886,165; 5,929,226; 5,977,296; 6,140,482; 
WO 00/56746; WO 01/14398, and related publications. Methods for synthesizing oligonucleotides 
comprising such analogs or derivatives are disclosed, for example, in the patent publications cited above 
and in U.S. Pat. Nos. 5,614,622; 5,739,314; 5,955,599; 5,962,674; 6,1 17,992; in WO 00/75372; and in 
related publications. 

[0037] Oligonucleotides may also be linked to a second moiety. The second moiety may be an 
additional nucleotide sequence such as a tail sequence (e.g. , a polyadenosine tail), an adapter sequence 
{e.g. , phage M 1 3 universal tail sequence), and others. Alternatively, the second moiety may be a non- 
nucleotide moiety such as a moiety which facilitates linkage to a solid support or a label to facilitate 
detection of the oligonucleotide. Such labels include, without limitation, a radioactive label, a 
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fluorescent label, a chemiluminescent label, a paramagnetic label, and the like. The second moiety may 
be attached to any position of the oligonucleotide, provided the oligonucleotide can hybridize to the 
nucleic acid comprising the polymorphism. 

Uses for Nucleic Acid Sequence 

[0038] Nucleic acid coding sequences may be used for diagnostic purposes for detection and control 
of polypeptide expression. Also, included herein are oligonucleotide sequences such as antisense RNA, 
small-interfering RNA (siRNA) and DNA molecules and ribozymes that function to inhibit translation of 
a polypeptide. Antisense techniques and RNA interference techniques are known in the art and are 
described herein. 

[0039J Ribozymes are enzymatic RNA molecules capable of catalyzing the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 
hammerhead motif ribozyme molecules may be engineered that specifically and efficiently catalyze 
endonucleolytic cleavage of RNA sequences corresponding to or complementary to APOL3 nucleotide 
sequences. Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
scanning the target molecule for ribozyme cleavage sites which include the following sequences, GUA, 
GUU and GUC. Once identified, short RNA sequences of between fifteen (15) and twenty (20) 
ribonucleotides corresponding to the region of the target gene containing the cleavage site may be 
evaluated for predicted structural features such as secondary structure that may render the oligonucleotide 
sequence unsuitable. The suitability of candidate targets may also be evaluated by testing their 
accessibility to hybridization with complementary oligonucleotides, using ribonuclease protection assays. 

[0040] Antisense RNA and DNA molecules, siRNA and ribozymes may be prepared by any method 
known in the art for the synthesis of RNA molecules. These include techniques for chemically 
synthesizing oligodeoxyribonucleotides well known in the art such as solid phase phosphoramidite 
chemical synthesis. Alternatively, RNA molecules may be generated by in vitro and in vivo transcription 
of DNA sequences encoding the antisense RNA molecule. Such DNA sequences may be incorporated 
into a wide variety of vectors which incorporate suitable RNA polymerase promoters such as the T7 or 
SP6 polymerase promoters. Alternatively, antisense cDNA constructs that synthesize antisense RNA 
constitutively or inducibly, depending on the promoter used, can be introduced stably into cell lines. 

[0041] DNA encoding a polypeptide also may have a number of uses for the diagnosis of diseases, 
including osteoarthritis, resulting from aberrant expression of a target gene described herein. For 
example, the nucleic acid sequence may be used in hybridization assays of biopsies or autopsies to 
diagnose abnormalities of expression or function {e.g., Southern or Northern blot analysis, in situ 
hybridization assays). 
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[0042] In addition, the expression of a polypeptide during embryonic development may also be 
determined using nucleic acid encoding the polypeptide. As addressed, infra, production of functionally 
impaired polypeptide is the cause of various disease states, such as osteoarthritis. In situ hybridizations 
using polypeptide as a probe may be employed to predict problems related to osteoarthritis. Further, as 
indicated, infra, administration of human active polypeptide, recombinantly produced as described 
herein, may be used to treat disease states related to functionally impaired polypeptide. Alternatively, 
gene therapy approaches may be employed to remedy deficiencies of functional polypeptide or to replace 
or compete with dysfunctional polypeptide. 

Expression Vectors. Host Cells, and Genetically Engineered Cells 

[0043] Provided herein are nucleic acid vectors, often expression vectors, which contain a APOL3 
nucleotide sequence, or a substantially identical sequence thereof. As used herein, the term "vector" 
refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked 
and can include a plasmid, cosmid, or viral vector. The vector can be capable of autonomous replication 
or it can integrate into a host DNA. Viral vectors may include replication defective retroviruses, 
adenoviruses and adeno-associated viruses for example. 

[0044] A vector can include a APOL3 nucleotide sequence in a form suitable for expression of an 
encoded target polypeptide or target nucleic acid in a host cell. A "target polypeptide" is a polypeptide 
encoded by a APOL3 nucleotide sequence, or a substantially identical nucleotide sequence thereof. The 
recombinant expression vector typically includes one or more regulatory sequences operatively linked to 
the nucleic acid sequence to be expressed. The term "regulatory sequence" includes promoters, 
enhancers and other expression control elements {e.g., polyadenylation signals). Regulatory sequences 
include those that direct constitutive expression of a nucleotide sequence, as well as tissue-specific 
regulatory and/or inducible sequences. The design of the expression vector can depend on such factors as 
the choice of the host cell to be transformed, the level of expression of polypeptide desired, and the like. 
Expression vectors can be introduced into host cells to produce target polypeptides, including fusion 
polypeptides. 

[0045] Recombinant expression vectors can be designed for expression of target polypeptides in 
prokaryotic or eukaryotic cells. For example, target polypeptides can be expressed in E. coli, insect cells 
{e.g. , using baculovirus expression vectors), yeast cells, or mammalian cells. Suitable host cells are 
discussed further in Goeddel, Gene Expression Technology: Methods in Enzymology 185, Academic 
Press, San Diego, CA (1990). Alternatively, the recombinant expression vector can be transcribed and 
translated in vitro, for example using T7 promoter regulatory sequences and T7 polymerase. 

[0046] Expression of polypeptides in prokaryotes is most often carried out in E. coli with vectors 
containing constitutive or inducible promoters directing the expression of either fusion or non-fusion 

11 



PATENT 
Docket 524593009200 



polypeptides. Fusion vectors add a number of amino acids to a polypeptide encoded therein, usually to 
the amino terminus of the recombinant polypeptide. Such fusion vectors typically serve three purposes: 
1) to increase expression of recombinant polypeptide; 2) to increase the solubility of the recombinant 
polypeptide; and 3) to aid in the purification of the recombinant polypeptide by acting as a ligand in 
affinity purification. Often, a proteolytic cleavage site is introduced at the junction of the fusion moiety 
and the recombinant polypeptide to enable separation of the recombinant polypeptide from the fusion 
moiety subsequent to purification of the fusion polypeptide. Such enzymes, and their cognate 
recognition sequences, include Factor Xa, thrombin and enterokinase. Typical fusion expression vectors 
include pGEX (Pharmacia Biotech Inc; Smith & Johnson, Gene 67: 31-40 (1988)), pMAL (New England 
Biolabs, Beverly, MA) and pRIT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase 
(GST), maltose E binding polypeptide, or polypeptide A, respectively, to the target recombinant 
polypeptide. 

[0047] Purified fusion polypeptides can be used in screening assays and to generate antibodies 
specific for target polypeptides. In a therapeutic embodiment, fusion polypeptide expressed in a 
retroviral expression vector is used to infect bone marrow cells that are subsequently transplanted into 
irradiated recipients. The pathology of the subject recipient is then examined after sufficient time has 
passed (e.g. , six (6) weeks). 

[0048] Expressing the polypeptide in host bacteria with an impaired capacity to proteolytically 
cleave the recombinant polypeptide is often used to maximize recombinant polypeptide expression 
(Gottesman, S., Gene Expression Technology: Methods in Enzymology, Academic Press, San Diego, 
California 185: 1 19-128 (1990)). Another strategy is to alter the nucleotide sequence of the nucleic acid 
to be inserted into an expression vector so that the individual codons for each amino acid are those 
preferentially utilized in E. coli (Wada et al, Nucleic Acids Res. 20: 2 1 1 1 -2 1 1 8 ( 1 992)). Such alteration 
of nucleotide sequences can be carried out by standard DNA synthesis techniques. 

[0049] When used in mammalian cells, the expression vector's control functions are often provided 
by viral regulatory elements. For example, commonly used promoters are derived from polyoma, 
Adenovirus 2, cytomegalovirus and Simian Virus 40. Recombinant mammalian expression vectors are 
often capable of directing expression of the nucleic acid in a particular cell type (e.g. , tissue-specific 
regulatory elements are used to express the nucleic acid). Non-limiting examples of suitable tissue- 
specific promoters include an albumin promoter (liver-specific; Pinkert et al., Genes Dev. J: 268-277 
(1987)), lymphoid-specific promoters (Calame & Eaton, Adv. Immunol. 43: 235-275 (1988)), promoters 
of T cell receptors (Winoto & Baltimore, EMBOJ. 8: 729-733 (1989)) promoters of immunoglobulins 
(Banerji et al., Cell 33: 729-740 (1983); Queen & Baltimore, Cell 33: 741-748 (1983)), neuron-specific 
promoters (e.g. , the neurofilament promoter; Byrne & Ruddle, Proc. Natl. Acad. Sci. USA 86: 5473- 
5477 (1989)), pancreas-specific promoters (Edlund et al., Science 230: 912-916 (1985)), and mammary 
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gland-specific promoters (e.g., milk whey promoter; U.S. Patent No. 4,873,316 and European 
Application Publication No. 264,166). Developmentally-regulated promoters are sometimes utilized, for 
example, the murine hox promoters (Kessel & Gruss, Science 249: 374-379 (1990)) and the a- 
fetopolypeptide promoter (Campes & Tilghman, Genes Dev. 3: 537-546 (1989)). 

[0050] A APOL3 nucleic acid also may be cloned into an expression vector in an antisense 
orientation. Regulatory sequences (e.g., viral promoters and/or enhancers) operatively linked to a APOL3 
nucleic acid cloned in the antisense orientation can be chosen for directing constitutive, tissue specific or 
cell type specific expression of antisense RNA in a variety of cell types. Antisense expression vectors 
can be in the form of a recombinant plasmid, phagemid or attenuated virus. For a discussion of the 
regulation of gene expression using antisense genes see, e.g., Weintraub et al., Antisense RNA as a 
molecular tool for genetic analysis, Reviews - Trends in Genetics, Vol. 1(1) (1986). 

[0051] Also provided herein are host cells that include a APOL3 nucleotide sequence within a 
recombinant expression vector or a fragment of such a nucleotide sequence which facilitate homologous 
recombination into a specific site of the host cell genome. The terms "host cell" and "recombinant host 
cell" are used interchangeably herein. Such terms refer not only to the particular subject cell but rather 
also to the progeny or potential progeny of such a cell. Because certain modifications may occur in 
succeeding generations due to either mutation or environmental influences, such progeny may not, in 
fact, be identical to the parent cell, but are still included within the scope of the term as used herein. A 
host cell can be any prokaryotic or eukaryotic cell. For example, a target polypeptide can be expressed in 
bacterial cells such as E. coli, insect cells, yeast or mammalian cells (such as Chinese hamster ovary cells 
(CHO) or COS cells). Other suitable host cells are known to those skilled in the art. 

[0052] Vectors can be introduced into host cells via conventional transformation or transfection 
techniques. As used herein, the terms "transformation" and "transfection" are intended to refer to a 
variety of art-recognized techniques for introducing foreign nucleic acid (e.g., DNA) into a host cell, 
including calcium phosphate or calcium chloride co-precipitation, transduction/infection, DEAE-dextran- 
mediated transfection, lipofection, or electroporation. 

[0053] A host cell provided herein can be used to produce (i.e., express) a target polypeptide or a 
substantially identical polypeptide thereof. Accordingly, further provided are methods for producing a 
target polypeptide using host cells described herein. In one embodiment, the method includes culturing 
host cells into which a recombinant expression vector encoding a target polypeptide has been introduced 
in a suitable medium such that a target polypeptide is produced. In another embodiment, the method 
further includes isolating a target polypeptide from the medium or the host cell. 

[0054] Also provided are cells or purified preparations of cells which include a APOL3 transgene, or 
which otherwise misexpress target polypeptide. Cell preparations can consist of human or non-human 
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cells, e.g., rodent cells, e.g., mouse or rat cells, rabbit cells, or pig cells. In preferred embodiments, the 
cell or cells include a APOL3 transgene {e.g. , a heterologous form of a APOL3 gene, such as a human 
gene expressed in non-human cells). The transgene can be misexpressed, e.g., overexpressed or 
underexpressed. In other preferred embodiments, the cell or cells include a gene which misexpress an 
endogenous target polypeptide (e.g., expression of a gene is disrupted, also known as a knockout). Such 
cells can serve as a model for studying disorders which are related to mutated or mis-expressed alleles or 
for use in drug screening. Also provided are human cells (e.g., a hematopoietic stem cells) transfected 
with a APOL3 nucleic acid. 

[0055] Also provided are cells or a purified preparation thereof (e.g. , human cells) in which an 
endogenous APOL3 nucleic acid is under the control of a regulatory sequence that does not normally 
control the expression of the endogenous gene. The expression characteristics of an endogenous gene 
within a cell (e.g., a cell line or microorganism) can be modified by inserting a heterologous DNA 
regulatory element into the genome of the cell such that the inserted regulatory element is operably 
linked to the corresponding endogenous gene. For example, an endogenous corresponding gene (e.g. , a 
gene which is "transcriptionally silent," not normally expressed, or expressed only at very low levels) 
may be activated by inserting a regulatory element which is capable of promoting the expression of a 
normally expressed gene product in that cell. Techniques such as targeted homologous recombinations, 
can be used to insert the heterologous DNA as described in, e.g., Chappel, US 5,272,071 ; WO 91/06667, 
published on May 16, 1991. 

Transgenic Animals 

[0056] Non-human transgenic animals that express a heterologous target polypeptide (e.g., 
expressed from a APOL3 nucleic acid or substantially identical sequence thereof) can be generated. Such 
animals are useful for studying the function and/or activity of a target polypeptide and for identifying 
and/or evaluating modulators of the activity of APOL3 nucleic acids and encoded polypeptides. As used 
herein, a "transgenic animal" is a non-human animal such as a mammal (e.g. , a non-human primate such 
as chimpanzee, baboon, or macaque; an ungulate such as an equine, bovine, or caprine; or a rodent such 
as a rat, a mouse, or an Israeli sand rat), a bird (e.g., a chicken or a turkey), an amphibian (e.g., a frog, 
salamander, or newt), or an insect (e.g., Drosophila melanogaster), in which one or more of the cells of 
the animal includes a transgene. A transgene is exogenous DNA or a rearrangement (e.g., a deletion of 
endogenous chromosomal DNA) that is often integrated into or occurs in the genome of cells in a 
transgenic animal. A transgene can direct expression of an encoded gene product in one or more cell 
types or tissues of the transgenic animal, and other transgenes can reduce expression (e.g., a knockout). 
Thus, a transgenic animal can be one in which an endogenous nucleic acid homologous to a APOL3 
nucleic acid has been altered by homologous recombination between the endogenous gene and an 
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exogenous DNA molecule introduced into a cell of the animal (e.g. , an embryonic cell of the animal) 
prior to development of the animal. 

[0057] Intronic sequences and polyadenylation signals can also be included in the transgene to 
increase expression efficiency of the transgene. One or more tissue-specific regulatory sequences can be 
operably linked to a APOL3 nucleotide sequence to direct expression of an encoded polypeptide to 
particular cells. A transgenic founder animal can be identified based upon the presence of a APOL3 
nucleotide sequence in its genome and/or expression of encoded mRNA in tissues or cells of the animals. 
A transgenic founder animal can then be used to breed additional animals carrying the transgene. 
Moreover, transgenic animals carrying a APOL3 nucleotide sequence can further be bred to other 
transgenic animals carrying other transgenes. 

[0058] Target polypeptides can be expressed in transgenic animals or plants by introducing, for 
example, a APOL3 nucleic acid into the genome of an animal that encodes the target polypeptide. In 
preferred embodiments the nucleic acid is placed under the control of a tissue specific promoter, e.g., a 
milk or egg specific promoter, and recovered from the milk or eggs produced by the animal. Also 
included is a population of cells from a transgenic animal. 

Target Polypeptides 

[0059] Also featured herein are isolated target polypeptides, which are encoded by a APOL3 
nucleotide sequence (e.g., SEQ ID NO: 1-7), or a substantially identical nucleotide sequence thereof. 
Examples of APOL3 polypeptides are set forth in SEQ ID NO: 8-10. The term "polypeptide" as used 
herein includes proteins and peptides. An "isolated" or "purified" polypeptide or protein is substantially 
free of cellular material or other contaminating proteins from the cell or tissue source from which the 
protein is derived, or substantially free from chemical precursors or other chemicals when chemically 
synthesized. In one embodiment, the language "substantially free" means preparation of a target 
polypeptide having less than about 30%, 20%, 10% and more preferably 5% (by dry weight), of non- 
target polypeptide (also referred to herein as a "contaminating protein"), or of chemical precursors or 
non-target chemicals. When the target polypeptide or a biologically active portion thereof is 
recombinantly produced, it is also preferably substantially free of culture medium, specifically, where 
culture medium represents less than about 20%, sometimes less than about 1 0%, and often less than 
about 5% of the volume of the polypeptide preparation. Isolated or purified target polypeptide 
preparations are sometimes 0.01 milligrams or more or 0. 1 milligrams or more, and often 1 .0 milligrams 
or more and 1 0 milligrams or more in dry weight. In certain embodiments, the APOL3 polypeptide or 
polypeptide fragment has APOL3 biological activity, for example, apolipoprotein activity. 

[0060] Further included herein are target polypeptide fragments. The polypeptide fragment may be 
a domain or part of a domain of a target polypeptide. The polypeptide fragment may have increased, 
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decreased or unexpected biological activity. The polypeptide fragment is often 50 or fewer, 1 00 or fewer, 
or 200 or fewer amino acids in length, and is sometimes 300, 400, 500, 600, 700, or 900 or fewer amino 
acids in length. The polypeptide fragment sometimes is amino acids 90-396; amino acids 19-325; or amino 
acids 1-196. 

[0061] Substantially identical target polypeptides may depart from the amino acid sequences of 
target polypeptides in different manners. For example, conservative amino acid modifications may be 
introduced at one or more positions in the amino acid sequences of target polypeptides. A "conservative 
amino acid substitution" is one in which the amino acid is replaced by another amino acid having a 
similar structure and/or chemical function. Families of amino acid residues having similar structures and 
functions are well known. These families include amino acids with basic side chains {e.g. , lysine, 
arginine, histidine), acidic side chains (e.g. , aspartic acid, glutamic acid), uncharged polar side chains 
(e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., 
alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side 
chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, 
tryptophan, histidine). Also, essential and non-essential amino acids may be replaced. A "non-essential" 
amino acid is one that can be altered without abolishing or substantially altering the biological function 
of a target polypeptide, whereas altering an "essential" amino acid abolishes or substantially alters the 
biological function of a target polypeptide. Amino acids that are conserved among target polypeptides 
are typically essential amino acids. In certain embodiments, the polypeptide includes one or more non- 
synonymous polymorphic variants associated with osteoarthritis. 

[0062] Also, target polypeptides may exist as chimeric or fusion polypeptides. As used herein, a 
target "chimeric polypeptide" or target "fusion polypeptide" includes a target polypeptide linked to a 
non-target polypeptide. A "non-target polypeptide" refers to a polypeptide having an amino acid 
sequence corresponding to a polypeptide which is not substantially identical to the target polypeptide, 
which includes, for example, a polypeptide that is different from the target polypeptide and derived from 
the same or a different organism. The target polypeptide in the fusion polypeptide can correspond to an 
entire or nearly entire target polypeptide or a fragment thereof. The non-target polypeptide can be fused 
to the N-terminus or C-terminus of the target polypeptide. 

[0063] Fusion polypeptides can include a moiety having high affinity for a ligand. For example, the 
fusion polypeptide can be a GST-target fusion polypeptide in which the target sequences are fused to the 
C-terminus of the GST sequences, or a polyhistidine-target fusion polypeptide in which the target 
polypeptide is fused at the N- or C-terminus to a string of histidine residues. Such fusion polypeptides 
can facilitate purification of recombinant target polypeptide. Expression vectors are commercially 
available that already encode a fusion moiety (e.g., a GST polypeptide), and a nucleotide sequence in 
SEQ ID NO: 1 -7, or a substantially identical nucleotide sequence thereof, can be cloned into an 
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expression vector such that the fusion moiety is linked in-frame to the target polypeptide. Further, the 
fusion polypeptide can be a target polypeptide containing a heterologous signal sequence at its N- 
terminus. In certain host cells (e.g. , mammalian host cells), expression, secretion, cellular internalization, 
and cellular localization of a target polypeptide can be increased through use of a heterologous signal 
sequence. Fusion polypeptides can also include all or a part of a serum polypeptide (e.g., an IgG constant 
region or human serum albumin). 

[0064] Target polypeptides can be incorporated into pharmaceutical compositions and administered 
to a subject in vivo. Administration of these target polypeptides can be used to affect the bioavailability 
of a substrate of the target polypeptide and may effectively increase target polypeptide biological activity 
in a cell. Target fusion polypeptides may be useful therapeutically for the treatment of disorders caused 
by, for example, (i) aberrant modification or mutation of a gene encoding a target polypeptide; (ii) mis- 
regulation of the gene encoding the target polypeptide; and (iii) aberrant post-translational modification 
of a target polypeptide. Also, target polypeptides can be used as immunogens to produce anti-target 
antibodies in a subject, to purify target polypeptide ligands or binding partners, and in screening assays to 
identify molecules which inhibit or enhance the interaction of a target polypeptide with a substrate. 

[0065] In addition, polypeptides can be chemically synthesized using techniques known in the art 
(See, e.g., Creighton, 1983 Proteins. New York, N.Y.: W. H. Freeman and Company; and Hunkapiller et 
al., (1984) Nature July 12 -18;310(5973):105-1 1). For example, a relative short fragment can be 
synthesized by use of a peptide synthesizer. Furthermore, if desired, non-classical amino acids or 
chemical amino acid analogs can be introduced as a substitution or addition into the fragment sequence. 
Non-classical amino acids include, but are not limited to, to the D-isomers of the common amino acids, 
2,4-diaminobutyric acid, a-amino isobutyric acid, 4-aminobutyric acid, Abu, 2-amino butyric acid, g- 
Abu, e-Ahx, 6-amino hexanoic acid, Aib, 2-amino isobutyric acid, 3-amino propionic acid, ornithine, 
norleucine, norvaline, hydroxyproline, sarcosine, citrulline, homocitrulline, cysteic acid, t-butylglycine, t- 
butylalanine, phenylglycine, cyclohexylalanine, b-alanine, fluoroamino acids, designer amino acids such 
as b-methyl amino acids, Ca-methyl amino acids, Na-methyl amino acids, and amino acid analogs in 
general. Furthermore, the amino acid can be D (dextrorotary) or L (levorotary). 

[0066] Polypeptides and polypeptide fragments sometimes are differentially modified during or after 
translation, e.g. , by glycosylation, acetylation, phosphorylation, amidation, derivatization by known 
protecting/blocking groups, proteolytic cleavage, linkage to an antibody molecule or other cellular ligand, 
etc. Any of numerous chemical modifications may be carried out by known techniques, including but not 
limited, to specific chemical cleavage by cyanogen bromide, trypsin, chymotrypsin, papain, V8 protease, 
NaBH4; acetylation, formylation, oxidation, reduction; metabolic synthesis in the presence of 
tunicamycin; and the like. Additional post-translational modifications include, for example, N-linked or 
O-linked carbohydrate chains, processing of N-terminal or C-terminal ends), attachment of chemical 

17 



PATENT 
Docket 524593009200 



moieties to the amino acid backbone, chemical modifications of N-linked or O-linked carbohydrate 
chains, and addition or deletion of an N-terminal methionine residue as a result of prokaryotic host cell 
expression. The polypeptide fragments may also be modified with a detectable label, such as an 
enzymatic, fluorescent, isotopic or affinity label to allow for detection and isolation of the polypeptide. 

[0067] Also provided are chemically modified derivatives of polypeptides that can provide 
additional advantages such as increased solubility, stability and circulating time of the polypeptide, or 
decreased immunogenicity (see e.g., U.S. Pat. No: 4,179,337. The chemical moieties for derivitization 
may be selected from water soluble polymers such as polyethylene glycol, ethylene glycol/propylene 
glycol copolymers, carboxymethylcellulose, dextran, polyvinyl alcohol and the like. The polypeptides 
may be modified at random positions within the molecule, or at predetermined positions within the 
molecule and may include one, two, three or more attached chemical moieties. 

[0068] The polymer may be of any molecular weight, and may be branched or unbranched. For 
polyethylene glycol, the preferred molecular weight is between about 1 kDa and about 1 00 kDa (the term 
"about" indicating that in preparations of polyethylene glycol, some molecules will weigh more, some 
less, than the stated molecular weight) for ease in handling and manufacturing. Other sizes may be used, 
depending on the desired therapeutic profile (e.g., the duration of sustained release desired, the effects, if 
any on biological activity, the ease in handling, the degree or lack of antigenicity and other known effects 
of the polyethylene glycol to a therapeutic protein or analog). 

[0069] The polymers should be attached to the polypeptide with consideration of effects on 
functional or antigenic domains of the polypeptide. There are a number of attachment methods available 
to those skilled in the art (e.g., EP 0 401 384 (coupling PEG to G-CSF) and Malik et al. (1992) Exp 
Hematol. September;20(8): 1028-35 (pegylation of GM-CSF using tresyl chloride)). For example, 
polyethylene glycol may be covalently bound through amino acid residues via a reactive group, such as a 
free amino or carboxyl group. Reactive groups are those to which an activated polyethylene glycol 
molecule may be bound. The amino acid residues having a free amino group may include lysine residues 
and the N-terminal amino acid residues; those having a free carboxyl group may include aspartic acid 
residues, glutamic acid residues and the C-terminal amino acid residue. Sulfhydryl groups may also be 
used as a reactive group for attaching the polyethylene glycol molecules. For therapeutic purposes, the 
attachment sometimes is at an amino group, such as attachment at the N-terminus or lysine group. 

[0070] Proteins can be chemically modified at the N-terminus. Using polyethylene glycol as an 
illustration of such a composition, one may select from a variety of polyethylene glycol molecules (by 
molecular weight, branching, and the like), the proportion of polyethylene glycol molecules to protein 
(polypeptide) molecules in the reaction mix, the type of pegylation reaction to be performed, and the 
method of obtaining the selected N-terminal ly pegylated protein. The method of obtaining the N- 
terminally pegylated preparation (i.e., separating this moiety from other monopegylated moieties if 
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necessary) may be by purification of the N-terminally pegylated material from a population of pegylated 
protein molecules. Selective proteins chemically modified at the N-terminus may be accomplished by 
reductive alkylation, which exploits differential reactivity of different types of primary amino groups 
(lysine versus the N-terminal) available for derivatization in a particular protein. Under the appropriate 
reaction conditions, substantially selective derivatization of the protein at the N-terminus with a carbonyl 
group containing polymer is achieved. 

Substantially Identical Nucleic Acids and Polypeptides 

[0071] Nucleotide sequences and polypeptide sequences that are substantially identical to a APOL3 
nucleotide sequence and the target polypeptide sequences encoded by those nucleotide sequences, 
respectively, are included herein. The term "substantially identical" as used herein refers to two or more 
nucleic acids or polypeptides sharing one or more identical nucleotide sequences or polypeptide 
sequences, respectively. Included are nucleotide sequences or polypeptide sequences that are 55% or 
more, 60% or more, 65% or more, 70% or more, 75% or more, 80% or more, 85% or more, 90% or more, 
95% or more (each often within a 1%, 2%, 3% or 4% variability) identical to a APOL3 nucleotide 
sequence or the encoded target polypeptide amino acid sequences. One test for determining whether two 
nucleic acids are substantially identical is to determine the percent of identical nucleotide sequences or 
polypeptide sequences shared between the nucleic acids or polypeptides. 

[0072] Calculations of sequence identity are often performed as follows. Sequences are aligned for 
optimal comparison purposes (e.g., gaps can be introduced in one or both of a first and a second amino 
acid or nucleic acid sequence for optimal alignment and non-homologous sequences can be disregarded 
for comparison purposes). The length of a reference sequence aligned for comparison purposes is 
sometimes 30% or more, 40% or more, 50% or more, often 60% or more, and more often 70% or more, 
80% or more, 90% or more, or 100% of the length of the reference sequence. The nucleotides or amino 
acids at corresponding nucleotide or polypeptide positions, respectively, are then compared among the 
two sequences. When a position in the first sequence is occupied by the same nucleotide or amino acid 
as the corresponding position in the second sequence, the nucleotides or amino acids are deemed to be 
identical at that position. The percent identity between the two sequences is a function of the number of 
identical positions shared by the sequences, taking into account the number of gaps, and the length of 
each gap, introduced for optimal alignment of the two sequences. 

[0073] Comparison of sequences and determination of percent identity between two sequences can 
be accomplished using a mathematical algorithm. Percent identity between two amino acid or nucleotide 
sequences can be determined using the algorithm of Meyers & Miller, CABIOS 4: 11-17 (1989), which 
has been incorporated into the ALIGN program (version 2.0), using a PAM120 weight residue table, a 
gap length penalty of 12 and a gap penalty of 4. Also, percent identity between two amino acid 
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sequences can be determined using the Needleman & Wunsch, J. Mol. Biol. 48: 444-453 (1970) 
algorithm which has been incorporated into the GAP program in the GCG software package (available at 
the http address www.gcg.com), using either a Blossum 62 matrix or a PAM250 matrix, and a gap weight 
of 16, 14, 12, 10, 8, 6, or 4 and a length weight of 1, 2, 3, 4, 5, or 6. Percent identity between two 
nucleotide sequences can be determined using the GAP program in the GCG software package (available 
at http address www.gcg.com), using a NWSgapdna.CMP matrix and a gap weight of 40, 50, 60, 70, or 
80 and a length weight of 1, 2, 3, 4, 5, or 6. A set of parameters often used is a Blossum 62 scoring 
matrix with a gap open penalty of 12, a gap extend penalty of 4, and a frameshift gap penalty of 5. 

[0074] Another manner for determining if two nucleic acids are substantially identical is to assess 
whether a polynucleotide homologous to one nucleic acid will hybridize to the other nucleic acid under 
stringent conditions. As use herein, the term "stringent conditions" refers to conditions for hybridization 
and washing. Stringent conditions are known to those skilled in the art and can be found in Current 
Protocols in Molecular Biology, John Wiley & Sons, N.Y. , 6.3.1-6.3.6 (1989). Aqueous and non- 
aqueous methods are described in that reference and either can be used. An example of stringent 
hybridization conditions is hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45°C, 
followed by one or more washes in 0.2X SSC, 0.1% SDS at 50°C. Another example of stringent 
hybridization conditions are hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45°C, 
followed by one or more washes in 0.2X SSC, 0.1% SDS at 55°C. A further example of stringent 
hybridization conditions is hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45°C, 
followed by one or more washes in 0.2X SSC, 0.1% SDS at 60°C. Often, stringent hybridization 
conditions are hybridization in 6X sodium chloride/sodium citrate (SSC) at about 45°C, followed by one 
or more washes in 0.2X SSC, 0.1% SDS at 65°C. More often, stringency conditions are 0.5M sodium 
phosphate, 7% SDS at 65°C, followed by one or more washes at 0.2X SSC, 1% SDS at 65°C. 

[0075] An example of a substantially identical nucleotide sequence to a nucleotide sequence in SEQ 
ID NO: 1-7 is one that has a different nucleotide sequence but still encodes the same polypeptide 
sequence encoded by the nucleotide sequence in SEQ ID NO: 1-7. Another example is a nucleotide 
sequence that encodes a polypeptide having a polypeptide sequence that is more than 70% or more 
identical to, sometimes more than 75% or more, 80% or more, or 85% or more identical to, and often 
more than 90% or more and 95% or more identical to a polypeptide sequence encoded by a nucleotide 
sequence in SEQ ID NO: 1-7. 

[0076] Nucleotide sequences in SEQ ID NO: 1 -7 and amino acid sequences of encoded polypeptides 
can be used as "query sequences" to perform a search against public databases to identify other family 
members or related sequences, for example. Such searches can be performed using the NBLAST and 
XBLAST programs (version 2.0) of Altschul et al, J. Mol. Biol. 215: 403-10 (1990). BLAST nucleotide 
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searches can be performed with the NBLAST program, score = 100, wordlength = 12 to obtain 
nucleotide sequences homologous to nucleotide sequences in SEQ ID NO: 1-7. BLAST polypeptide 
searches can be performed with the XBLAST program, score = 50, wordlength = 3 to obtain amino acid 
sequences homologous to polypeptides encoded by the nucleotide sequences of SEQ ID NO: 1 -7. To 
obtain gapped alignments for comparison purposes, Gapped BLAST can be utilized as described in 
Altschul et al, Nucleic Acids Res. 25(1 7): 3389-3402 (1 997). When utilizing BLAST and Gapped 
BLAST programs, default parameters of the respective programs {e.g., XBLAST and NBLAST) can be 
used {see the http address www.ncbi.nlm.nih.gov). 

[0077] A nucleic acid that is substantially identical to a nucleotide sequence in SEQ ID NO: 1-7 
may include polymorphic sites at positions equivalent to those described herein when the sequences are 
aligned. For example, using the alignment procedures described herein, SNPs in a sequence substantially 
identical to a sequence in SEQ ED NO: 1-7 can be identified at nucleotide positions that match {i.e., align) 
with nucleotides at SNP positions in each nucleotide sequence in SEQ ED NO: 1-7. Also, where a 
polymorphic variation results in an insertion or deletion, insertion or deletion of a nucleotide sequence 
from a reference sequence can change the relative positions of other polymorphic sites in the nucleotide 
sequence. 

[00781 Substantially identical nucleotide and polypeptide sequences include those that are naturally 
occurring, such as allelic variants (same locus), splice variants, homologs (different locus), and orthologs 
(different organism) or can be non-naturally occurring. Non-naturally occurring variants can be 
generated by mutagenesis techniques, including those applied to polynucleotides, cells, or organisms. 
The variants can contain nucleotide substitutions, deletions, inversions and insertions. Variation can 
occur in either or both the coding and non-coding regions. The variations can produce both conservative 
and non-conservative amino acid substitutions (as compared in the encoded product). Orthologs, 
homologs, allelic variants, and splice variants can be identified using methods known in the art. These 
variants normally comprise a nucleotide sequence encoding a polypeptide that is 50% or more, about 
55% or more, often about 70-75% or more or about 80-85% or more, and sometimes about 90-95% or 
more identical to the amino acid sequences of target polypeptides or a fragment thereof. Such nucleic 
acid molecules can readily be identified as being able to hybridize under stringent conditions to a 
nucleotide sequence in SEQ ED NO: 1 or a fragment of this sequence. Nucleic acid molecules 
corresponding to orthologs, homologs, and allelic variants of a nucleotide sequence in SEQ ED NO: 1-7 
can further be identified by mapping the sequence to the same chromosome or locus as the nucleotide 
sequence in SEQ ED NO: 1-7. 

[0079] Also, substantially identical nucleotide sequences may include codons that are altered with 
respect to the naturally occurring sequence for enhancing expression of a target polypeptide in a 
particular expression system. For example, the nucleic acid can be one in which one or more codons are 
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altered, and often 10% or more or 20% or more of the codons are altered for optimized expression in 
bacteria (e.g., E. coli.), yeast (e.g., S. cervesiae), human (e.g., 293 cells), insect, or rodent (e.g., hamster) 
cells. 

Methods for Identifying Risk of osteoarthritis 

[0080] Methods for prognosing and diagnosing osteoarthritis are included herein. These methods 
include detecting the presence or absence of one or more polymorphic variations in a nucleotide sequence 
associated with osteoarthritis, such as variants in or around the loci set forth herein, or a substantially 
identical sequence thereof, in a sample from a subject, where the presence of a polymorphic variant 
described herein is indicative of a risk of osteoarthritis. Determining a risk of osteoarthritis sometimes 
refers to determining whether an individual is at an increased risk of osteoarthritis (e.g., intermediate risk 
or higher risk). 

[0081] Thus, featured herein is a method for identifying a subject who is at risk of osteoarthritis, 
which comprises detecting an aberration associated with osteoarthritis in a nucleic acid sample from the 
subject. An embodiment is a method for detecting a risk of osteoarthritis in a subject, which comprises 
detecting the presence or absence of a polymorphic variation associated with osteoarthritis at a 
polymorphic site in a nucleotide sequence in a nucleic acid sample from a subject, where the nucleotide 
sequence comprises a polynucleotide sequence selected from the group consisting of: (a) a nucleotide 
sequence of SEQ ID NO: 1 -7; (b) a nucleotide sequence which encodes a polypeptide consisting of an 
amino acid sequence encoded by a nucleotide sequence of SEQ ID NO: 1-7; (c) a nucleotide sequence 
which encodes a polypeptide that is 90% or more identical to an amino acid sequence encoded by a 
nucleotide sequence of SEQ ID NO: 1-7, or a nucleotide sequence about 90% or more identical to a 
nucleotide sequence of SEQ ID NO: 1-7; and (d) a fragment of a nucleotide sequence of (a), (b), or (c) 
comprising the polymorphic site; whereby the presence of the polymorphic variation is indicative of a 
predisposition to osteoarthritis in the subject. In certain embodiments, polymorphic variants at the 
positions described herein are detected for determining a risk of osteoarthritis, and polymorphic variants 
at positions in linkage disequilibrium with these positions are detected for determining a risk of 
osteoarthritis. As used herein, "SEQ ID NO: 1-7" refers to individual sequences in SEQ ID NO: 1, 2, 3, 
4, 5, 6 or 7, each sequence being separately applicable to embodiments described herein. 

[0082] Risk of osteoarthritis sometimes is expressed as a probability, such as an odds ratio, 
percentage, or risk factor. Risk often is based upon the presence or absence of one or more polymorphic 
variants described herein, and also may be based in part upon phenotypic traits of the individual being 
tested. Methods for calculating risk based upon patient data are well known (see, e.g., Agresti, 
Categorical Data Analysis, 2nd Ed. 2002. Wiley). Allelotyping and genotyping analyses may be carried 
out in populations other than those exemplified herein to enhance the predictive power of the prognostic 
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method. These further analyses are executed in view of the exemplified procedures described herein, and 
may be based upon the same polymorphic variations or additional polymorphic variations. 

[0083] In certain embodiments, determining the presence of a combination of two or more 
polymorphic variants associated with osteoarthritis in one or more genetic loci (e.g., one or more genes) 
of the sample is determined to identify, quantify and/or estimate, risk of osteoarthritis. The risk often is 
the probability of having or developing osteoarthritis. The risk sometimes is expressed as a relative risk 
with respect to a population average risk of osteoarthritis, and sometimes is expressed as a relative risk 
with respect to the lowest risk group. Such relative risk assessments often are based upon penetrance 
values determined by statistical methods, and are particularly useful to clinicians and insurance 
companies for assessing risk of osteoarthritis (e.g., a clinician can target appropriate detection, prevention 
and therapeutic regimens to a patient after determining the patient's risk of osteoarthritis, and an 
insurance company can fine tune actuarial tables based upon population genotype assessments of 
osteoarthritis risk). Risk of osteoarthritis sometimes is expressed as an odds ratio, which is the odds of a 
particular person having a genotype has or will develop osteoarthritis with respect to another genotype 
group (e.g., the most disease protective genotype or population average). In related embodiments, the 
determination is utilized to identify a subject at risk of osteoarthritis. In an embodiment, two or more 
polymorphic variations are detected in two or more regions in human genomic DNA associated with 
increased risk of osteoarthritis, such as a locus containing a APOL3, for example. In certain 
embodiments, 3 or more, or 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16, 17, 18, 19 or 20 or more 
polymorphic variants are detected in the sample. In specific embodiments, polymorphic variants are 
detected in a APOL3 region, for example. In certain embodiments, polymorphic variants are detected at 
other genetic loci (e.g., the polymorphic variants can be detected in APOL3 in addition to other loci or 
only in other loci), where the other loci include but are not limited to those described in concurrently- 
filed patent applications having attorney docket number 524593008700, 524593008800, 524593008900, 
524593009000 or 524593009100, each of which is incorporated herein by reference in its entirety. 

[0084] Results from prognostic tests may be combined with other test results to diagnose 
osteoarthritis. For example, prognostic results may be gathered, a patient sample may be ordered based 
on a determined predisposition to osteoarthritis, the patient sample is analyzed, and the results of the 
analysis may be utilized to diagnose osteoarthritis. Also osteoarthritis diagnostic method can be 
developed from studies used to generate prognostic methods in which populations are stratified into 
subpopulations having different progressions of osteoarthritis. In another embodiment, prognostic results 
may be gathered, a patient's risk factors for developing osteoarthritis (e.g., age, weight, race, diet) 
analyzed, and a patient sample may be ordered based on a determined predisposition to osteoarthritis. 

[0085] The nucleic acid sample typically is isolated from a biological sample obtained from a 
subject. For example, nucleic acid can be isolated from blood, saliva, sputum, urine, cell scrapings, and 
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biopsy tissue. The nucleic acid sample can be isolated from a biological sample using standard 
techniques, such as the technique described in Example 2. As used herein, the term "subject" refers 
primarily to humans but also refers to other mammals such as dogs, cats, and ungulates (e.g., cattle, 
sheep, and swine). Subjects also include avians (e.g., chickens and turkeys), reptiles, and fish (e.g., 
salmon), as embodiments described herein can be adapted to nucleic acid samples isolated from any of 
these organisms. The nucleic acid sample may be isolated from the subject and then directly utilized in a 
method for determining the presence of a polymorphic variant, or alternatively, the sample may be 
isolated and then stored (e.g., frozen) for a period of time before being subjected to analysis. 

[0086] The presence or absence of a polymorphic variant is determined using one or both 
chromosomal complements represented in the nucleic acid sample. Determining the presence or absence 
of a polymorphic variant in both chromosomal complements represented in a nucleic acid sample from a 
subject having a copy of each chromosome is useful for determining the zygosity of an individual for the 
polymorphic variant (i.e., whether the individual is homozygous or heterozygous for the polymorphic 
variant). Any oligonucleotide-based diagnostic may be utilized to determine whether a sample includes 
the presence or absence of a polymorphic variant in a sample. For example, primer extension methods, 
ligase sequence determination methods (e.g., U.S. Pat. Nos. 5,679,524 and 5,952,174, and WO 
01/27326), mismatch sequence determination methods (e.g., U.S. Pat. Nos. 5,851,770; 5,958,692; 
6,1 10,684; and 6,183,958), microarray sequence determination methods, restriction fragment length 
polymorphism (RFLP), single strand conformation polymorphism detection (SSCP) (e.g., U.S. Pat. Nos. 
5,891,625 and 6,013,499), PCR-based assays (e.g., TAQMAN® PCR System (Applied Biosystems)), and 
nucleotide sequencing methods may be used. 

[0087] Oligonucleotide extension methods typically involve providing a pair of oligonucleotide 
primers in a polymerase chain reaction (PCR) or in other nucleic acid amplification methods for the 
purpose of amplifying a region from the nucleic acid sample that comprises the polymorphic variation. 
One oligonucleotide primer is complementary to a region 3 ' of the polymorphism and the other is 
complementary to a region 5 ' of the polymorphism. A PCR primer pair may be used in methods 
disclosed in U.S. Pat. Nos. 4,683,195; 4,683,202, 4,965,188; 5,656,493; 5,998,143; 6,140,054; WO 
01/27327; and WO 01/27329 for example. PCR primer pairs may also be used in any commercially 
available machines that perform PCR, such as any of the GENEAMP® Systems available from Applied 
Biosystems. Also, those of ordinary skill in the art will be able to design oligonucleotide primers based 
upon a APOL3 nucleotide sequence using knowledge available in the art. 

[0088] Also provided is an extension oligonucleotide that hybridizes to the amplified fragment 
adjacent to the polymorphic variation. As used herein, the term "adjacent" refers to the 3' end of the 
extension oligonucleotide being often 1 nucleotide from the 5 ' end of the polymorphic site, and 
sometimes 2, 3, 4, 5, 6, 7, 8, 9, or 10 nucleotides from the 5' end of the polymorphic site, in the nucleic 
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acid when the extension oligonucleotide is hybridized to the nucleic acid. The extension oligonucleotide 
then is extended by one or more nucleotides, and the number and/or type of nucleotides that are added to 
the extension oligonucleotide determine whether the polymorphic variant is present. Oligonucleotide 
extension methods are disclosed, for example, in U.S. Pat. Nos. 4,656,127; 4,851,331; 5,679,524; 
5,834,189; 5,876,934; 5,908,755; 5,912,118; 5,976,802; 5,981,186; 6,004,744; 6,013,431; 6,017,702; 
6,046,005; 6,087,095; 6,210,891; and WO 01/20039. Oligonucleotide extension methods using mass 
spectrometry are described, for example, in U.S. Pat. Nos. 5,547,835; 5,605,798; 5,691,141; 5,849,542; 
5,869,242; 5,928,906; 6,043,031; and 6,194,144, and a method often utilized is described herein in 
Example 2. 

[0089] A microarray can be utilized for determining whether a polymorphic variant is present or 
absent in a nucleic acid sample. A microarray may include any oligonucleotides described herein, and 
methods for making and using oligonucleotide microarrays suitable for diagnostic use are disclosed in 
U.S. Pat. Nos. 5,492,806; 5,525,464; 5,589,330; 5,695,940; 5,849,483; 6,018,041; 6,045,996; 6,136,541; 
6,142,681; 6,156,501; 6,197,506; 6,223,127; 6,225,625; 6,229,91 1; 6,239,273; WO 00/52625; WO 
01/25485; and WO 01/29259. The microarray typically comprises a solid support and the 
oligonucleotides may be linked to this solid support by covalent bonds or by non-covalent interactions. 
The oligonucleotides may also be linked to the solid support directly or by a spacer molecule. A 
microarray may comprise one or more oligonucleotides complementary to a polymorphic site set forth 
herein. 

[0090] A kit also may be utilized for determining whether a polymorphic variant is present or absent 
in a nucleic acid sample. A kit often comprises one or more pairs of oligonucleotide primers useful for 
amplifying a fragment of a nucleotide sequence of SEQ ID NO: 1-7 or a substantially identical sequence 
thereof, where the fragment includes a polymorphic site. The kit sometimes comprises a polymerizing 
agent, for example, a thermostable nucleic acid polymerase such as one disclosed in U.S. Pat. Nos. 
4,889,818 or 6,077,664. Also, the kit often comprises an elongation oligonucleotide that hybridizes to a 
APOL3 nucleotide sequence in a nucleic acid sample adjacent to the polymorphic site. Where the kit 
includes an elongation oligonucleotide, it also often comprises chain elongating nucleotides, such as 
dATP, dTTP, dGTP, dCTP, and dITP, including analogs of dATP, dTTP, dGTP, dCTP and dITP, 
provided that such analogs are substrates for a thermostable nucleic acid polymerase and can be 
incorporated into a nucleic acid chain elongated from the extension oligonucleotide. Along with chain 
elongating nucleotides would be one or more chain terminating nucleotides such as ddATP, ddTTP, 
ddGTP, ddCTP, and the like. In an embodiment, the kit comprises one or more oligonucleotide primer 
pairs, a polymerizing agent, chain elongating nucleotides, at least one elongation oligonucleotide, and 
one or more chain terminating nucleotides. Kits optionally include buffers, vials, microtiter plates, and 
instructions for use. 
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[0091] An individual identified as being at risk of osteoarthritis may be heterozygous or 
homozygous with respect to the allele associated with a higher risk of osteoarthritis. A subject 
homozygous for an allele associated with an increased risk of osteoarthritis is at a comparatively high 
risk of osteoarthritis, a subject heterozygous for an allele associated with an increased risk of 
osteoarthritis is at a comparatively intermediate risk of osteoarthritis, and a subject homozygous for an 
allele associated with a decreased risk of osteoarthritis is at a comparatively low risk of osteoarthritis. A 
genotype may be assessed for a complementary strand, such that the complementary nucleotide at a 
particular position is detected. 

[0092] Also featured are methods for determining risk of osteoarthritis and/or identifying a subject 
at risk of osteoarthritis by contacting a polypeptide or protein encoded by a APOL3 nucleotide sequence 
from a subject with an antibody that specifically binds to an epitope associated with increased risk of 
osteoarthritis in the polypeptide. 

Applications of Prognostic and Diagnostic Results to Pharmacogenomic Methods 
[00931 Pharmacogenomics is a discipline that involves tailoring a treatment for a subject according 
to the subject's genotype as a particular treatment regimen may exert a differential effect depending upon 
the subject's genotype. For example, based upon the outcome of a prognostic test described herein, a 
clinician or physician may target pertinent information and preventative or therapeutic treatments to a 
subject who would be benefited by the information or treatment and avoid directing such information and 
treatments to a subject who would not be benefited {e.g., the treatment has no therapeutic effect and/or 
the subject experiences adverse side effects). 

[0094] The following is an example of a pharmacogenomic embodiment. A particular treatment 
regimen can exert a differential effect depending upon the subject's genotype. Where a candidate 
therapeutic exhibits a significant interaction with a major allele and a comparatively weak interaction 
with a minor allele (e.g. , an order of magnitude or greater difference in the interaction), such a 
therapeutic typically would not be administered to a subject genotyped as being homozygous for the 
minor allele, and sometimes not administered to a subject genotyped as being heterozygous for the minor 
allele. In another example, where a candidate therapeutic is not significantly toxic when administered to 
subjects who are homozygous for a major allele but is comparatively toxic when administered to subjects 
heterozygous or homozygous for a minor allele, the candidate therapeutic is not typically administered to 
subjects who are genotyped as being heterozygous or homozygous with respect to the minor allele. 

[0095] The methods described herein are applicable to pharmacogenomic methods for preventing, 
alleviating or treating osteoarthritis. For example, a nucleic acid sample from an individual may be 
subjected to a prognostic test described herein. Where one or more polymorphic variations associated 
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with increased risk of osteoarthritis are identified in a subject, information for preventing or treating 
osteoarthritis and/or one or more osteoarthritis treatment regimens then may be prescribed to that subject. 

[0096] In certain embodiments, a treatment or preventative regimen is specifically prescribed and/or 
administered to individuals who will most benefit from it based upon their risk of developing 
osteoarthritis assessed by the methods described herein. Thus, provided are methods for identifying a 
subject predisposed to osteoarthritis and then prescribing a therapeutic or preventative regimen to 
individuals identified as having a predisposition. Thus, certain embodiments are directed to a method for 
reducing osteoarthritis in a subject, which comprises: detecting the presence or absence of a polymorphic 
variant associated with osteoarthritis in a nucleotide sequence in a nucleic acid sample from a subject, 
where the nucleotide sequence comprises a polynucleotide sequence selected from the group consisting 
of: (a) a nucleotide sequence of SEQ ID NO: 1 -7; (b) a nucleotide sequence which encodes a polypeptide 
consisting of an amino acid sequence encoded by a nucleotide sequence of SEQ ID NO: 1 -7; (c) a 
nucleotide sequence which encodes a polypeptide that is 90% or more identical to an amino acid 
sequence encoded by a nucleotide sequence of SEQ ID NO: 1-7, or a nucleotide sequence about 90% or 
more identical to a nucleotide sequence of SEQ ID NO: 1-7; and (d) a fragment of a polynucleotide 
sequence of (a), (b), or (c); and prescribing or administering a treatment regimen to a subject from whom 
the sample originated where the presence of a polymorphic variation associated with osteoarthritis is 
detected in the nucleotide sequence. In these methods, predisposition results may be utilized in 
combination with other test results to diagnose osteoarthritis. 

[0097] Certain preventative treatments often are prescribed to subjects having a predisposition to 
osteoarthritis and where the subject is diagnosed with osteoarthritis or is diagnosed as having symptoms 
indicative of an early stage of osteoarthritis. The treatment sometimes is preventative {e.g., is prescribed 
or administered to reduce the probability that osteoarthritis arises or progresses), sometimes is 
therapeutic, and sometimes delays, alleviates or halts the progression of osteoarthritis. Any known 
preventative or therapeutic treatment for alleviating or preventing the occurrence of osteoarthritis is 
prescribed and/or administered. For example, the treatment often is directed to decreasing pain and 
improving joint movement. Examples of OA treatments include exercises to keep joints flexible and 
improve muscle strength. Different medications to control pain, including corticosteroids and 
nonsteroidal anti-inflammatory drugs (NSAIDs, e.g., Voltaren); cyclooxygenase-2 (COX-2) inhibitors 
(e.g., Celebrex, Vioxx, Mobic, and Bextra); monoclonal antibodies (e.g., Remicade); tumor necrosis 
factor inhibitors (e.g., Enbrel); or injections of glucocorticoids, hyaluronic acid or chondrotin sulfate into 
joints that are inflamed and not responsive to NSAIDS. Orally administered chondroitin sulfate also may 
be used as a therapeutic, as it may increase hyaluronic acid levels and viscosity of synovial fluid, and 
decrease collagenase levels in synovial fluid. Also, glucosamine can serve as an OA therapeutic as 
delivering it into joints may inhibit enzymes involved in cartilage degradation and enhance the 
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production of hyaluronic acid. For mild pain without inflammation, acetaminophen may be used. Other 
treatments include: heat/cold therapy for temporary pain relief; joint protection to prevent strain or stress 
on painful joints; surgery to relieve chronic pain in damaged joints; and weight control to prevent extra 
stress on weight-bearing joints. 

[0098] As therapeutic approaches for treating osteoarthritis continue to evolve and improve, the goal 
of treatments for osteoarthritis related disorders is to intervene even before clinical signs first manifest. 
Thus, genetic markers associated with susceptibility to osteoarthritis prove useful for early diagnosis, 
prevention and treatment of osteoarthritis. 

[0099] As osteoarthritis preventative and treatment information can be specifically targeted to 
subjects in need thereof (e.g., those at risk of developing osteoarthritis or those in an early stage of 
osteoarthritis), provided herein is a method for preventing or reducing the risk of developing 
osteoarthritis in a subject, which comprises: (a) detecting the presence or absence of a polymorphic 
variation associated with osteoarthritis at a polymorphic site in a nucleotide sequence in a nucleic acid 
sample from a subject; (b) identifying a subject with a predisposition to osteoarthritis, whereby the 
presence of the polymorphic variation is indicative of a predisposition to osteoarthritis in the subject; and 
(c) if such a predisposition is identified, providing the subject with information about methods or 
products to prevent or reduce osteoarthritis or to delay the onset of osteoarthritis. Also provided is a 
method of targeting information or advertising to a subpopulation of a human population based on the 
subpopulation being genetically predisposed to a disease or condition, which comprises: (a) detecting the 
presence or absence of a polymorphic variation associated with osteoarthritis at a polymorphic site in a 
nucleotide sequence in a nucleic acid sample from a subject; (b) identifying the subpopulation of subjects 
in which the polymorphic variation is associated with osteoarthritis; and (c) providing information only 
to the subpopulation of subjects about a particular product which may be obtained and consumed or 
applied by the subject to help prevent or delay onset of the disease or condition. 

[0100] Pharmacogenomics methods also may be used to analyze and predict a response to 
osteoarthritis treatment or a drug. For example, if pharmacogenomics analysis indicates a likelihood that 
an individual will respond positively to osteoarthritis treatment with a particular drug, the drug may be 
administered to the individual. Conversely, if the analysis indicates that an individual is likely to respond 
negatively to treatment with a particular drug, an alternative course of treatment may be prescribed. A 
negative response may be defined as either the absence of an efficacious response or the presence of toxic 
side effects. The response to a therapeutic treatment can be predicted in a background study in which 
subjects in any of the following populations are genotyped: a population that responds favorably to a 
treatment regimen, a population that does not respond significantly to a treatment regimen, and a 
population that responds adversely to a treatment regimen (e.g., exhibits one or more side effects). These 
populations are provided as examples and other populations and subpopulations may be analyzed. Based 
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upon the results of these analyses, a subject is genotyped to predict whether he or she will respond 
favorably to a treatment regimen, not respond significantly to a treatment regimen, or respond adversely 
to a treatment regimen. 

[0101] The tests described herein also are applicable to clinical drug trials. One or more 
polymorphic variants indicative of response to an agent for treating osteoarthritis or to side effects to an 
agent for treating osteoarthritis may be identified using the methods described herein. Thereafter, 
potential participants in clinical trials of such an agent may be screened to identify those individuals most 
likely to respond favorably to the drug and exclude those likely to experience side effects. In that way, 
the effectiveness of drug treatment may be measured in individuals who respond positively to the drug, 
without lowering the measurement as a result of the inclusion of individuals who are unlikely to respond 
positively in the study and without risking undesirable safety problems. 

[0102] Thus, another embodiment is a method of selecting an individual for inclusion in a clinical 
trial of a treatment or drug comprising the steps of: (a) obtaining a nucleic acid sample from an 
individual; (b) determining the identity of a polymorphic variation which is associated with a positive 
response to the treatment or the drug, or at least one polymorphic variation which is associated with a 
negative response to the treatment or the drug in the nucleic acid sample, and (c) including the individual 
in the clinical trial if the nucleic acid sample contains said polymorphic variation associated with a 
positive response to the treatment or the drug or if the nucleic acid sample lacks said polymorphic 
variation associated with a negative response to the treatment or the drug. In addition, the methods 
described herein for selecting an individual for inclusion in a clinical trial of a treatment or drug 
encompass methods with any further limitation described in this disclosure, or those following, specified 
alone or in any combination. The polymorphic variation may be in a sequence selected individually or in 
any combination from the group consisting of (i) a nucleotide sequence of SEQ ID NO: 1-7; (ii) a 
nucleotide sequence which encodes a polypeptide consisting of an amino acid sequence encoded by a 
nucleotide sequence of SEQ ID NO: 1-7; (iii) a nucleotide sequence which encodes a polypeptide that is 
90% or more identical to an amino acid sequence encoded by a nucleotide sequence of SEQ ID NO: 1 -7, 
or a nucleotide sequence about 90% or more identical to a nucleotide sequence of SEQ ID NO: 1-7; and 
(iv) a fragment of a polynucleotide sequence of (i), (ii), or (iii) comprising the polymorphic site. The 
including step (c) optionally comprises administering the drug or the treatment to the individual if the 
nucleic acid sample contains the polymorphic variation associated with a positive response to the 
treatment or the drug and the nucleic acid sample lacks said biallelic marker associated with a negative 
response to the treatment or the drug. 

[0103] Also provided herein is a method of partnering between a diagnostic/prognostic testing 
provider and a provider of a consumable product, which comprises: (a) the diagnostic/prognostic testing 
provider detects the presence or absence of a polymorphic variation associated with osteoarthritis at a 
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polymorphic site in a nucleotide sequence in a nucleic acid sample from a subject; (b) the 
diagnostic/prognostic testing provider identifies the subpopulation of subjects in which the polymorphic 
variation is associated with osteoarthritis; (c) the diagnostic/prognostic testing provider forwards 
information to the subpopulation of subjects about a particular product which may be obtained and 
consumed or applied by the subject to help prevent or delay onset of the disease or condition; and (d) the 
provider of a consumable product forwards to the diagnostic test provider a fee every time the 
diagnostic/prognostic test provider forwards information to the subject as set forth in step (c) above. 

Compositions Comprising Qsteoarthritis-Directed Molecules 

[0104] Featured herein is a composition comprising a cell from a subject having osteoarthritis or at 
risk of osteoarthritis and one or more molecules specifically directed and targeted to a nucleic acid 
comprising a APOL3 nucleotide sequence or amino acid sequence. Such directed molecules include, but 
are not limited to, a compound that binds to a APOL3 nucleotide sequence or amino acid sequence 
referenced herein; a RNAi or siRNA molecule having a strand complementary or substantially 
complementary to a APOL3 nucleotide sequence (e.g., hybridizes to a APOL3 nucleotide sequence under 
conditions of high stringency); an antisense nucleic acid complementary or substantially complementary 
to an RNA encoded by a APOL3 nucleotide sequence (e.g., hybridizes to a APOL3 nucleotide sequence 
under conditions of high stringency); a ribozyme that hybridizes to a APOL3 nucleotide sequence (e.g., 
hybridizes to a APOL3 nucleotide sequence under conditions of high stringency); a nucleic acid aptamer 
that specifically binds a polypeptide encoded by APOL3 nucleotide sequence; and an antibody that 
specifically binds to a polypeptide encoded by APOL3 nucleotide sequence or binds to a nucleic acid 
having such a nucleotide sequence. In specific embodiments, the osteoarthritis directed molecule 
interacts with a nucleic acid or polypeptide variant associated with osteoarthritis, such as variants 
referenced herein. In other embodiments, the osteoarthritis directed molecule interacts with a 
polypeptide involved in a signal pathway of a polypeptide encoded by a APOL3 nucleotide sequence, or a 
nucleic acid comprising such a nucleotide sequence. 

[0105] Compositions sometimes include an adjuvant known to stimulate an immune response, and 
in certain embodiments, an adjuvant that stimulates a T-cell lymphocyte response. Adjuvants are known, 
including but not limited to an aluminum adjuvant (e.g., aluminum hydroxide); a cytokine adjuvant or 
adjuvant that stimulates a cytokine response (e.g., interleukin (IL)-12 and/or gamma-interferon 
cytokines); a Freund-type mineral oil adjuvant emulsion (e.g., Freund's complete or incomplete 
adjuvant); a synthetic lipoid compound; a copolymer adjuvant (e.g., TitreMax); a saponin; Quil A; a 
liposome; an oil-in-water emulsion (e.g., an emulsion stabilized by Tween 80 and pluronic 
polyoxyethlene/polyoxypropylene block copolymer (Syntex Adjuvant Formulation); TitreMax; 
detoxified endotoxin (MPL) and mycobacterial cell wall components (TDW, CWS) in 2% squalene (Ribi 
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Adjuvant System)); a muramyl dipeptide; an immune-stimulating complex (ISCOM, e.g., an Ag- 
modified saponin/cholesterol micelle that forms stable cage-like structure); an aqueous phase adjuvant 
that does not have a depot effect (e.g., Gerbu adjuvant); a carbohydrate polymer (e.g., AdjuPrime); L- 
tyrosine; a manide-oleate compound (e.g., Montanide); an ethylene- vinyl acetate copolymer (e.g., Elvax 
40W1 ,2); or lipid A, for example. Such compositions are useful for generating an immune response 
against osteoarthritis directed molecule (e.g., an HLA-binding subsequence within a polypeptide encoded 
by a APOL3 nucleotide sequence). In such methods, a peptide having an amino acid subsequence of a 
polypeptide encoded by a APOL3 nucleotide sequence is delivered to a subject, where the subsequence 
binds to an HLA molecule and induces a CTL lymphocyte response. The peptide sometimes is delivered 
to the subject as an isolated peptide or as a mini gene in a plasmid that encodes the peptide. Methods for 
identifying HLA-binding subsequences in such polypeptides are known (see e.g., publication 
WO02/20616 and PCT application US98/01373 for methods of identifying such sequences). 

[0106] The cell may be in a group of cells cultured in vitro or in a tissue maintained in vitro or 
present in an animal in vivo (e.g., a rat, mouse, ape or human). In certain embodiments, a composition 
comprises a component from a cell such as a nucleic acid molecule (e.g., genomic DNA), a protein 
mixture or isolated protein, for example. The aforementioned compositions have utility in diagnostic, 
prognostic and pharmacogenomic methods described previously and in therapeutics described hereafter. 
Certain osteoarthritis directed molecules are described in greater detail below. 

Compounds 

[0107] Compounds can be obtained using any of the numerous approaches in combinatorial library 
methods known in the art, including: biological libraries; peptoid libraries (libraries of molecules having 
the functionalities of peptides, but with a novel, non-peptide backbone which are resistant to enzymatic 
degradation but which nevertheless remain bioactive (see, e.g., Zuckermann et al., J. Med. Chem.37: 
2678-85 (1994)); spatially addressable parallel solid phase or solution phase libraries; synthetic library 
methods requiring deconvolution; "one-bead one-compound" library methods; and synthetic library 
methods using affinity chromatography selection. Biological library and peptoid library approaches are 
typically limited to peptide libraries, while the other approaches are applicable to peptide, non-peptide 
oligomer or small molecule libraries of compounds (Lam, Anticancer Drug Des. 12: 145, (1997)). 
Examples of methods for synthesizing molecular libraries are described, for example, in DeWitt et al., 
Proc. Natl. Acad. Sci. U.S.A. 90: 6909 (1993); Erb et al., Proc. Natl. Acad. Sci. USA 91 : 1 1422 (1994); 
Zuckermann et al., J. Med. Chem. 37: 2678 (1994); Cho et al., Science 261: 1303 (1993); Carrell et al., 
Angew. Chem. Int. Ed. Engl. 33: 2059 (1994); Carell et al., Angew. Chem. Int. Ed. Engl. 33: 2061 
(1994); and in Gallop et al., J. Med. Chem. 37: 1233 (1994). 
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[0108] Libraries of compounds may be presented in solution (e.g., Houghten, Biotechniques 13: 
412-421 (1992)), or on beads (Lam, Nature 354: 82-84 (1991)), chips (Fodor, Nature 364: 555-556 
(1993)), bacteria or spores (Ladner, United States Patent No. 5,223,409), plasmids (Cull et al., Proc. Natl. 
Acad. Sci. USA 89: 1865-1869 (1992)) or on phage (Scott and Smith, Science 249: 386-390 (1990); 
Devlin, Science 249: 404-406 (1990); Cwirla et al., Proc. Natl. Acad. Sci. 87: 6378-6382 (1990); Felici, 
J. Mol. Biol. 222: 301-310 (1991); Ladner supra.). 

[0109] A compound sometimes alters expression and sometimes alters activity of a polypeptide 
target and may be a small molecule. Small molecules include, but are not limited to, peptides, 
peptidomimetics (e.g., peptoids), amino acids, amino acid analogs, polynucleotides, polynucleotide 
analogs, nucleotides, nucleotide analogs, organic or inorganic compounds (i.e., including heteroorganic 
and organometallic compounds) having a molecular weight less than about 10,000 grams per mole, 
organic or inorganic compounds having a molecular weight less than about 5,000 grams per mole, 
organic or inorganic compounds having a molecular weight less than about 1 ,000 grams per mole, 
organic or inorganic compounds having a molecular weight less than about 500 grams per mole, and 
salts, esters, and other pharmaceutically acceptable forms of such compounds. 

Antisense Nucleic Acid Molecules. Ribozymes. RNAi. siRNA and Modified Nucleic 
Acid Molecules 

[0110] An "antisense" nucleic acid refers to a nucleotide sequence complementary to a "sense" 
nucleic acid encoding a polypeptide, e.g., complementary to the coding strand of a double-stranded 
cDNA molecule or complementary to an mRNA sequence. The antisense nucleic acid can be 
complementary to an entire coding strand, or to a portion thereof or a substantially identical sequence 
thereof. In another embodiment, the antisense nucleic acid molecule is antisense to a "noncoding region" 
of the coding strand of a nucleotide sequence (e.g., 5' and 3' untranslated regions in SEQ ID NO: 1). 

[0111] An antisense nucleic acid can be designed such that it is complementary to the entire coding 
region of an mRNA encoded by a nucleotide sequence (e.g., SEQ ED NO: 1), and often the antisense 
nucleic acid is an oligonucleotide antisense to only a portion of a coding or noncoding region of the 
mRNA. For example, the antisense oligonucleotide can be complementary to the region surrounding the 
translation start site of the mRNA, e.g., between the -10 and +10 regions of the target gene nucleotide 
sequence of interest. An antisense oligonucleotide can be, for example, about 7, 10, 15, 20, 25, 30, 35, 
40, 45, 50, 55, 60, 65, 70, 75, 80, or more nucleotides in length. The antisense nucleic acids, which 
include the ribozymes described hereafter, can be designed to target a APOL3 nucleotide sequence, often 
a variant associated with osteoarthritis, or a substantially identical sequence thereof. Among the variants, 
minor alleles and major alleles can be targeted, and those associated with a higher risk of osteoarthritis 
are often designed, tested, and administered to subjects. 
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[0112] An antisense nucleic acid can be constructed using chemical synthesis and enzymatic ligation 
reactions using standard procedures. For example, an antisense nucleic acid (e.g., an antisense 
oligonucleotide) can be chemically synthesized using naturally occurring nucleotides or variously 
modified nucleotides designed to increase the biological stability of the molecules or to increase the 
physical stability of the duplex formed between the antisense and sense nucleic acids, e.g., 
phosphorothioate derivatives and acridine substituted nucleotides can be used. Antisense nucleic acid 
also can be produced biologically using an expression vector into which a nucleic acid has been 
subcloned in an antisense orientation (i.e., RNA transcribed from the inserted nucleic acid will be of an 
antisense orientation to a target nucleic acid of interest, described further in the following subsection). 

[0113] When utilized as therapeutics, antisense nucleic acids typically are administered to a subject 
(e.g., by direct injection at a tissue site) or generated in situ such that they hybridize with or bind to 
cellular mRNA and/or genomic DNA encoding a polypeptide and thereby inhibit expression of the 
polypeptide, for example, by inhibiting transcription and/or translation. Alternatively, antisense nucleic 
acid molecules can be modified to target selected cells and then are administered systemically. For 
systemic administration, antisense molecules can be modified such that they specifically bind to receptors 
or antigens expressed on a selected cell surface, for example, by linking antisense nucleic acid molecules 
to peptides or antibodies which bind to cell surface receptors or antigens. Antisense nucleic acid 
molecules can also be delivered to cells using the vectors described herein. Sufficient intracellular 
concentrations of antisense molecules are achieved by incorporating a strong promoter, such as a pol II or 
pol III promoter, in the vector construct. 

[0114] Antisense nucleic acid molecules sometimes are alpha-anomeric nucleic acid molecules. An 
alpha-anomeric nucleic acid molecule forms specific double-stranded hybrids with complementary RNA 
in which, contrary to the usual beta-units, the strands run parallel to each other (Gaultier et al., Nucleic 
Acids. Res. 15: 6625-6641 (1987)). Antisense nucleic acid molecules can also comprise a 2'-o- 
methylribonucleotide (Inoue et al., Nucleic Acids Res. 15: 6131-6148 (1987)) or a chimeric RNA-DNA 
analogue (Inoue et al., FEBS Lett. 215: 327-330 (1987)). Antisense nucleic acids sometimes are 
composed of DNA or PNA or any other nucleic acid derivatives described previously. 

[0115] In another embodiment, an antisense nucleic acid is a ribozyme. A ribozyme having 
specificity for a APOL3 nucleotide sequence can include one or more sequences complementary to such a 
nucleotide sequence, and a sequence having a known catalytic region responsible for mRNA cleavage 
(see e.g., U.S. Pat. No. 5,093,246 or Haselhoff and Gerlach, Nature 334: 585-591 (1988)). For example, 
a derivative of a Tetrahymena L-19 IVS RNA is sometimes utilized in which the nucleotide sequence of 
the active site is complementary to the nucleotide sequence to be cleaved in a mRNA (see e.g., Cech et 
al. U.S. Patent No. 4,987,071; and Cech et al. U.S. Patent No. 5,1 16,742). Also, target mRNA sequences 
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can be used to select a catalytic RNA having a specific ribonuclease activity from a pool of RNA 
molecules (see e.g., Bartel & Szostak, Science 261: 1411-1418 (1993)). 

[0116] Osteoarthritis directed molecules include in certain embodiments nucleic acids that can form 
triple helix structures with a APOL3 nucleotide sequence, or a substantially identical sequence thereof, 
especially one that includes a regulatory region that controls expression of a polypeptide. Gene 
expression can be inhibited by targeting nucleotide sequences complementary to the regulatory region of 
a nucleotide sequence referenced herein or a substantially identical sequence (e.g., promoter and/or 
enhancers) to form triple helical structures that prevent transcription of a gene in target cells (see e.g., 
Helene, Anticancer Drug Des. 6(6): 569-84 (1991); Helene et al., Ann. N.Y. Acad. Sci. 660: 27-36 
(1992); and Maher, Bioassays 14(12): 807-15 (1992). Potential sequences that can be targeted for triple 
helix formation can be increased by creating a so-called "switchback" nucleic acid molecule. Switchback 
molecules are synthesized in an alternating 5'-3', 3'-5' manner, such that they base pair with first one 
strand of a duplex and then the other, eliminating the necessity for a sizeable stretch of either purines or 
pyrimidines to be present on one strand of a duplex. 

[0117] Osteoarthritis directed molecules include RNAi and siRNA nucleic acids. Gene expression 
may be inhibited by the introduction of double-stranded RNA (dsRNA), which induces potent and 
specific gene silencing, a phenomenon called RNA interference or RNAi. See, e.g., Fire et al., US Patent 
Number 6,506,559; Tuschl et al. PCT International Publication No. WO 01/75164; Kay et al. PCT 
International Publication No. WO 03/0 101 80A1; or Bosher JM, Labouesse, Nat Cell Biol 2000 
Feb;2(2):E31-6. This process has been improved by decreasing the size of the double-stranded RNA to 
20-24 base pairs (to create small-interfering RNAs or siRNAs) that "switched off genes in mammalian 
cells without initiating an acute phase response, i.e., a host defense mechanism that often results in cell 
death (see, e.g., Caplen et al. Proc Natl Acad Sci USA. 2001 Aug 14;98(17):9742-7 and Elbashir et al. 
Methods 2002 Feb;26(2): 199-213). There is increasing evidence of post-transcriptional gene silencing 
by RNA interference (RNAi) for inhibiting targeted expression in mammalian cells at the mRNA level, 
in human cells. There is additional evidence of effective methods for inhibiting the proliferation and 
migration of tumor cells in human patients, and for inhibiting metastatic cancer development (see, e.g., 
U.S. Patent Application No. US2001 000993 1 83; Caplen et al. Proc Natl Acad Sci USA; and 
Abderrahmani et al. Mol Cell Biol 2001 Nov21(21):7256-67). 

[01 18] An "siRNA" or "RNAi" refers to a nucleic acid that forms a double stranded RNA and has 
the ability to reduce or inhibit expression of a gene or target gene when the siRNA is delivered to or 
expressed in the same cell as the gene or target gene. "siRNA" refers to short double-stranded RNA 
formed by the complementary strands. Complementary portions of the siRNA that hybridize to form the 
double stranded molecule often have substantial or complete identity to the target molecule sequence. In 
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one embodiment, an siRNA refers to a nucleic acid that has substantial or complete identity to a target 
gene and forms a double stranded siRNA. 

[0119] When designing the siRNA molecules, the targeted region often is selected from a given 
DNA sequence beginning 50 to 100 nucleotides downstream of the start codon. See, e.g., Elbashir et al,. 
Methods 26: 199-213 (2002). Initially, 5' or 3' UTRs and regions nearby the start codon were avoided 
assuming that UTR-binding proteins and/or translation initiation complexes may interfere with binding of 
the siRNP or RISC endonuclease complex. Sometimes regions of the target 23 nucleotides in length 
conforming to the sequence motif AA(N19)TT (N, an nucleotide), and regions with approximately 30% 
to 70% G/C-content (often about 50% G/C-content) often are selected. If no suitable sequences are 
found, the search often is extended using the motif NA(N21). The sequence of the sense siRNA 
sometimes corresponds to (N19) TT or N21 (position 3 to 23 of the 23-nt motif), respectively. In the 
latter case, the 3' end of the sense siRNA often is converted to TT. The rationale for this sequence 
conversion is to generate a symmetric duplex with respect to the sequence composition of the sense and 
antisense 3' overhangs. The antisense siRNA is synthesized as the complement to position 1 to 21 of the 
23-nt motif. Because position 1 of the 23-nt motif is not recognized sequence-specifically by the 
antisense siRNA, the 3' -most nucleotide residue of the antisense siRNA can be chosen deliberately. 
However, the penultimate nucleotide of the antisense siRNA (complementary to position 2 of the 23-nt 
motif) often is complementary to the targeted sequence. For simplifying chemical synthesis, TT often is 
utilized. siRNAs corresponding to the target motif NAR(N 17) YNN, where R is purine (A,G) and Y is 
pyrimidine (C,U), often are selected. Respective 21 nucleotide sense and antisense siRNAs often begin 
with a purine nucleotide and can also be expressed from pol III expression vectors without a change in 
targeting site. Expression of RNAs from pol III promoters often is efficient when the first transcribed 
nucleotide is a purine. 

[0120] The sequence of the siRNA can correspond to the full length target gene, or a subsequence 
thereof. Often, the siRNA is about 15 to about 50 nucleotides in length (e.g., each complementary 
sequence of the double stranded siRNA is 15-50 nucleotides in length, and the double stranded siRNA is 
about 15-50 base pairs in length, sometimes about 20-30 nucleotides in length or about 20-25 nucleotides 
in length, e.g., 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, or 30 nucleotides in length. The siRNA sometimes 
is about 21 nucleotides in length. Methods of using siRNA are well known in the art, and specific siRNA 
molecules may be purchased from a number of companies including Dharmacon Research, Inc. 

[0121] Antisense, ribozyme, RNAi and siRNA nucleic acids can be altered to form modified nucleic 
acid molecules. The nucleic acids can be altered at base moieties, sugar moieties or phosphate backbone 
moieties to improve stability, hybridization, or solubility of the molecule. For example, the deoxyribose 
phosphate backbone of nucleic acid molecules can be modified to generate peptide nucleic acids (see 
Hyrup et al., Bioorganic & Medicinal Chemistry 4 (1): 5-23 (1996)). As used herein, the terms "peptide 
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nucleic acid" or "PNA" refers to a nucleic acid mimic such as a DNA mimic, in which the deoxyribose 
phosphate backbone is replaced by a pseudopeptide backbone and only the four natural nucleobases are 
retained. The neutral backbone of a PNA can allow for specific hybridization to DNA and RNA under 
conditions of low ionic strength. Synthesis of PNA oligomers can be performed using standard solid 
phase peptide synthesis protocols as described, for example, in Hyrup et al., (1996) supra and Perry- 
O'Keefe et al., Proc. Natl. Acad. Sci. 93: 14670-675 (1996). 

[0122] PNA nucleic acids can be used in prognostic, diagnostic, and therapeutic applications. For 
example, PNAs can be used as antisense or antigene agents for sequence-specific modulation of gene 
expression by, for example, inducing transcription or translation arrest or inhibiting replication. PNA 
nucleic acid molecules can also be used in the analysis of single base pair mutations in a gene, (e.g., by 
PNA-directed PCR clamping); as "artificial restriction enzymes" when used in combination with other 
enzymes, (e.g., SI nucleases (Hyrup (1996) supra)); or as probes or primers for DNA sequencing or 
hybridization (Hyrup et al., (1996) supra; Perry-O'Keefe supra). 

[0123] In other embodiments, oligonucleotides may include other appended groups such as peptides 
(e.g., for targeting host cell receptors in vivo), or agents facilitating transport across cell membranes (see 
e.g., Letsinger et al., Proc. Natl. Acad. Sci. USA 86: 6553-6556 (1989); Lemaitre et al., Proc. Natl. Acad. 
Sci. USA 84: 648-652 (1987); PCT Publication No. W088/09810) or the blood-brain barrier (see, e.g., 
PCT Publication No. W089/10134). In addition, oligonucleotides can be modified with hybridization- 
triggered cleavage agents (See, e.g., Krol et al., Bio-Techniques 6: 958-976 (1988)) or intercalating 
agents. (See, e.g., Zon, Pharm. Res. 5: 539-549 (1988) ). To this end, the oligonucleotide may be 
conjugated to another molecule, (e.g., a peptide, hybridization triggered cross-linking agent, transport 
agent, or hybridization-triggered cleavage agent). 

[0124] Also included herein are molecular beacon oligonucleotide primer and probe molecules 
having one or more regions complementary to a APOL3 nucleotide sequence, or a substantially identical 
sequence thereof, two complementary regions one having a fluorophore and one a quencher such that the 
molecular beacon is useful for quantifying the presence of the nucleic acid in a sample. Molecular 
beacon nucleic acids are described, for example, in Lizardi et al., U.S. Patent No. 5,854,033; Nazarenko 
et al., U.S. Patent No. 5,866,336, and Livak et al., U.S. Patent 5,876,930. 

Antibodies 

[0125] The term "antibody" as used herein refers to an immunoglobulin molecule or 
immunologically active portion thereof, i.e., an antigen-binding portion. Examples of immunologically 
active portions of immunoglobulin molecules include F(ab) and F(ab') 2 fragments which can be 
generated by treating the antibody with an enzyme such as pepsin. An antibody sometimes is a 
polyclonal, monoclonal, recombinant (e.g., a chimeric or humanized), fully human, non-human (e.g., 
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murine), or a single chain antibody. An antibody may have effector function and can fix complement, 
and is sometimes coupled to a toxin or imaging agent. 

[0126] A full-length polypeptide or antigenic peptide fragment encoded by a nucleotide sequence 
referenced herein can be used as an immunogen or can be used to identify antibodies made with other 
immunogens, e.g., cells, membrane preparations, and the like. An antigenic peptide often includes at 
least 8 amino acid residues of the amino acid sequences encoded by a nucleotide sequence referenced 
herein, or substantially identical sequence thereof, and encompasses an epitope. Antigenic peptides 
sometimes include 10 or more amino acids, 15 or more amino acids, 20 or more amino acids, or 30 or 
more amino acids. Hydrophilic and hydrophobic fragments of polypeptides sometimes are used as 
immunogens. 

[0127] Epitopes encompassed by the antigenic peptide are regions located on the surface of the 
polypeptide (e.g., hydrophilic regions) as well as regions with high antigenicity. For example, an Emini 
surface probability analysis of the human polypeptide sequence can be used to indicate the regions that 
have a particularly high probability of being localized to the surface of the polypeptide and are thus likely 
to constitute surface residues useful for targeting antibody production. The antibody may bind an epitope 
on any domain or region on polypeptides described herein. 

[0128] Also, chimeric, humanized, and completely human antibodies are useful for applications 
which include repeated administration to subjects. Chimeric and humanized monoclonal antibodies, 
comprising both human and non-human portions, can be made using standard recombinant DNA 
techniques. Such chimeric and humanized monoclonal antibodies can be produced by recombinant DNA 
techniques known in the art, for example using methods described in Robinson et al International 
Application No. PCT/US86/02269; Akira, et al European Patent Application 1 84,187; Taniguchi, M., 
European Patent Application 171,496; Morrison et al European Patent Application 173,494; Neuberger et 
al PCT International Publication No. WO 86/01533; Cabilly et al U.S. Patent No. 4,816,567; Cabilly et al 
European Patent Application 125,023; Better et al., Science 240: 1041-1043 (1988); Liu et al., Proc. Natl. 
Acad. Sci. USA 84: 3439-3443 (1987); Liu et al., J. Immunol. 139: 3521-3526 (1987); Sun et al., Proc. 
Natl. Acad. Sci. USA 84: 214-218 (1987); Nishimura et al., Cane. Res. 47: 999-1005 (1987); Wood et al., 
Nature 314: 446-449 (1985); and Shaw et al., J. Natl. Cancer Inst. 80: 1553-1559 (1988); Morrison, S. 
L., Science 229: 1202-1207 (1985); Oi et al., BioTechniques 4: 214 (1986); Winter U.S. Patent 
5,225,539; Jones et al., Nature 321: 552-525 (1986); Verhoeyan et al., Science 239: 1534; and Beidler et 
al., J. Immunol. 141: 4053-4060 (1988). 

[0129] Completely human antibodies are particularly desirable for therapeutic treatment of human 
patients. Such antibodies can be produced using transgenic mice that are incapable of expressing 
endogenous immunoglobulin heavy and light chains genes, but which can express human heavy and light 
chain genes. See, for example, Lonberg and Huszar, Int. Rev. Immunol. 13: 65-93 (1995); and U.S. 
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Patent Nos. 5,625,126; 5,633,425; 5,569,825; 5,661,016; and 5,545,806. In addition, companies such as 
Abgenix, Inc. (Fremont, CA) and Medarex, Inc. (Princeton, NJ), can be engaged to provide human 
antibodies directed against a selected antigen using technology similar to that described above. 
Completely human antibodies that recognize a selected epitope also can be generated using a technique 
referred to as "guided selection." In this approach a selected non-human monoclonal antibody (e.g., a 
murine antibody) is used to guide the selection of a completely human antibody recognizing the same 
epitope. This technology is described for example by Jespers et al., Bio/Technology 12: 899-903 (1994). 

[0130] An antibody can be a single chain antibody. A single chain antibody (scFV) can be 
engineered (see, e.g., Colcher et al., Ann. N Y Acad. Sci. 880: 263-80 (1999); and Reiter, Clin. Cancer 
Res. 2: 245-52 (1996)). Single chain antibodies can be dimerized or multimerized to generate 
multivalent antibodies having specificities for different epitopes of the same target polypeptide. 

[0131] Antibodies also may be selected or modified so that they exhibit reduced or no ability to bind 
an Fc receptor. For example, an antibody may be an isotype or subtype, fragment or other mutant, which 
does not support binding to an Fc receptor (e.g., it has a mutagenized or deleted Fc receptor binding 
region). 

[0132] Also, an antibody (or fragment thereof) may be conjugated to a therapeutic moiety such as a 
cytotoxin, a therapeutic agent or a radioactive metal ion. A cytotoxin or cytotoxic agent includes any 
agent that is detrimental to cells. Examples include taxol, cytochalasin B, gramicidin D, ethidium 
bromide, emetine, mitomycin, etoposide, tenoposide, vincristine, vinblastine, colchicin, doxorubicin, 
daunorubicin, dihydroxy anthracin dione, mitoxantrone, mithramycin, actinomycin D, 1 
dehydrotestosterone, glucocorticoids, procaine, tetracaine, lidocaine, propranolol, and puromycin and 
analogs or homo logs thereof. Therapeutic agents include, but are not limited to, antimetabolites (e.g., 
methotrexate, 6-mercaptopurine, 6-thioguanine, cytarabine, 5-fluorouracil decarbazine), alkylating agents 
(e.g., mechlorethamine, thiotepa chlorambucil, melphalan, carmustine (BCNU) and lomustine (CCNU), 
cyclophosphamide, busulfan, dibromomannitol, streptozotocin, mitomycin C, and cis-dichlorodiamine 
platinum (II) (DDP) cisplatin), anthracyclines (e.g., daunorubicin (formerly daunomycin) and 
doxorubicin), antibiotics (e.g., dactinomycin (formerly actinomycin), bleomycin, mithramycin, and 
anthramycin (AMC)), and anti-mitotic agents (e.g., vincristine and vinblastine). 

[0133] Antibody conjugates can be used for modifying a given biological response. For example, 
the drug moiety may be a protein or polypeptide possessing a desired biological activity. Such proteins 
may include, for example, a toxin such as abrin, ricin A, pseudomonas exotoxin, or diphtheria toxin; a 
polypeptide such as tumor necrosis factor, gamma-interferon, alpha-interferon, nerve growth factor, 
platelet derived growth factor, tissue plasminogen activator; or, biological response modifiers such as, for 
example, lymphokines, interleukin-1 ("IL-1"), interleukin-2 ("IL-2"), interleukin-6 ("IL-6"), granulocyte 
macrophage colony stimulating factor ("GM-CSF"), granulocyte colony stimulating factor ("G-CSF"), or 
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other growth factors. Also, an antibody can be conjugated to a second antibody to form an antibody 
heteroconjugate as described by Segal in U.S. Patent No. 4,676,980, for example. 

[0134] An antibody (e.g., monoclonal antibody) can be used to isolate target polypeptides by 
standard techniques, such as affinity chromatography or immunoprecipitation. Moreover, an antibody 
can be used to detect a target polypeptide (e.g., in a cellular lysate or cell supernatant) in order to evaluate 
the abundance and pattern of expression of the polypeptide. Antibodies can be used diagnostically to 
monitor polypeptide levels in tissue as part of a clinical testing procedure, e.g., to determine the efficacy 
of a given treatment regimen. Detection can be facilitated by coupling (i.e., physically linking) the 
antibody to a detectable substance (i.e., antibody labeling). Examples of detectable substances include 
various en2ymes, prosthetic groups, fluorescent materials, luminescent materials, bioluminescent 
materials, and radioactive materials. Examples of suitable enzymes include horseradish peroxidase, 
alkaline phosphatase, fl-galactosidase, or acetylcholinesterase; examples of suitable prosthetic group 
complexes include streptavidin/biotin and avidin/biotin; examples of suitable fluorescent materials 
include umbelliferone, fluorescein, fluorescein isothiocyanate, rhodamine, dichlorotriazinylamine 
fluorescein, dansyl chloride or phycoerythrin; an example of a luminescent material includes luminol; 
examples of bioluminescent materials include luciferase, luciferin, and aequorin, and examples of 
suitable radioactive material include 125 I, 13 'i, 35 S or 3 H. Also, an antibody can be utilized as a test 
molecule for determining whether it can treat osteoarthritis, and as a therapeutic for administration to a 
subject for treating osteoarthritis. 

[0135] An antibody can be made by immunizing with a purified antigen, or a fragment thereof, e.g., 
a fragment described herein, a membrane associated antigen, tissues, e.g., crude tissue preparations, 
whole cells, preferably living cells, lysed cells, or cell fractions. 

[0136] Included herein are antibodies which bind only a native polypeptide, only denatured or 
otherwise non-native polypeptide, or which bind both, as well as those having linear or conformational 
epitopes. Conformational epitopes sometimes can be identified by selecting antibodies that bind to 
native but not denatured polypeptide. Also featured are antibodies that specifically bind to a polypeptide 
variant associated with osteoarthritis. 

Methods for Identifying Candidate Therapeutics for Treating Osteoarthritis 

[0137] Current therapies for the treatment of osteoarthritis have limited efficacy, limited tolerability 
and significant mechanism-based side effects, and few of the available therapies adequately address 
underlying defects. Current therapeutic approaches were largely developed in the absence of defined 
molecular targets or even a solid understanding of disease pathogenesis. Therefore, provided are 
methods of identifying candidate therapeutics that target biochemical pathways related to the 
development of osteoarthritis. 
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[0138] Thus, featured herein are methods for identifying a candidate therapeutic for treating 
osteoarthritis. The methods comprise contacting a test molecule with a target molecule in a system. A 
"target molecule" as used herein refers to a APOL3 nucleic acid, a substantially identical nucleic acid 
thereof, or a fragment thereof, and an encoded polypeptide of the foregoing. The methods also comprise 
determining the presence or absence of an interaction between the test molecule and the target molecule, 
where the presence of an interaction between the test molecule and the nucleic acid or polypeptide 
identifies the test molecule as a candidate osteoarthritis therapeutic. The interaction between the test 
molecule and the target molecule may be quantified. 

[0139] Test molecules and candidate therapeutics include, but are not limited to, compounds, 
antisense nucleic acids, siRNA molecules, ribozymes, polypeptides or proteins encoded by a APOL3 
nucleotide sequence, or a substantially identical sequence or fragment thereof, and immunotherapeutics 
(e.g., antibodies and HLA-presented polypeptide fragments). A test molecule or candidate therapeutic 
may act as a modulator of target molecule concentration or target molecule function in a system. A 
"modulator" may agonize (i.e., up-regulates) or antagonize (i.e., down-regulates) a target molecule 
concentration partially or completely in a system by affecting such cellular functions as DNA replication 
and/or DNA processing (e.g., DNA methylation or DNA repair), RNA transcription and/or RNA 
processing (e.g., removal of intronic sequences and/or translocation of spliced mRNA from the nucleus), 
polypeptide production (e.g., translation of the polypeptide from mRNA), and/or polypeptide post- 
translational modification (e.g., glycosylation, phosphorylation, and proteolysis of pro-polypeptides). A 
modulator may also agonize or antagonize a biological function of a target molecule partially or 
completely, where the function may include adopting a certain structural conformation, interacting with 
one or more binding partners, ligand binding, catalysis (e.g., phosphorylation, dephosphorylation, 
hydrolysis, methylation, and isomerization), and an effect upon a cellular event {e.g., effecting 
progression of osteoarthritis). 

[0140] As used herein, the term "system" refers to a cell free in vitro environment and a cell-based 
environment such as a collection of cells, a tissue, an organ, or an organism. A system is "contacted" 
with a test molecule in a variety of manners, including adding molecules in solution and allowing them to 
interact with one another by diffusion, cell injection, and any administration routes in an animal. As used 
herein, the term "interaction" refers to an effect of a test molecule on test molecule, where the effect 
sometimes is binding between the test molecule and the target molecule, and sometimes is an observable 
change in cells, tissue, or organism. 

[0141] There are many standard methods for detecting the presence or absence of interaction 
between a test molecule and a target molecule. For example, titrametric, acidimetric, radiometric, NMR, 
monolayer, polarographic, spectrophotometric, fluorescent, and ESR assays probative of a target 
molecule interaction may be utilized. 
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[0142] Test molecule/target molecule interactions can be detected and/or quantified using assays 
known in the art. For example, an interaction can be determined by labeling the test molecule and/or the 
target molecule, where the label is covalently or non-covalently attached to the test molecule or target 
molecule. The label is sometimes a radioactive molecule such as l25 I, 13I I, 35 S or 3 H, which can be 
detected by direct counting of radioemission or by scintillation counting. Also, enzymatic labels such as 
horseradish peroxidase, alkaline phosphatase, or luciferase may be utilized where the enzymatic label can 
be detected by determining conversion of an appropriate substrate to product. In addition, presence or 
absence of an interaction can be determined without labeling. For example, a microphysiometer {e.g., 
Cytosensor) is an analytical instrument that measures the rate at which a cell acidifies its environment 
using a light-addressable potentiometric sensor (LAPS). Changes in this acidification rate can be used as 
an indication of an interaction between a test molecule and target molecule (McConnell, H. M. et al.. 
Science 257: 1906-1912(1992)). 

[0143] In cell-based systems, cells typically include a APOL3 nucleic acid, an encoded polypeptide, 
or substantially identical nucleic acid or polypeptide thereof, and are often of mammalian origin, 
although the cell can be of any origin. Whole cells, cell homogenates, and cell fractions {e.g., cell 
membrane fractions) can be subjected to analysis. Where interactions between a test molecule with a 
target polypeptide are monitored, soluble and/or membrane bound forms of the polypeptide may be 
utilized. Where membrane-bound forms of the polypeptide are used, it may be desirable to utilize a 
solubilizing agent. Examples of such solubilizing agents include non-ionic detergents such as n- 
octylglucoside, n-dodecylglucoside, n-dodecylmaltoside, octanoyl-N-methylglucamide, decanoyl-N- 

methylglucamide, Triton® X-100, Triton® X-l 14, Thesit®, Isotridecypoly(ethylene glycol ether) n , 3- 

[(3 -cholamidopropyl)dimethylamminio]-l -propane sulfonate (CHAPS), 3-[(3- 

cholamidopropyl)dimethylarnminio]-2-hydroxy-l -propane sulfonate (CHAPSO), or N-dodecyl-N,N- 
dimethyl-3-ammonio-l -propane sulfonate. 

[0144] An interaction between a test molecule and target molecule also can be detected by 
monitoring fluorescence energy transfer (FET) {see, e.g., Lakowicz et al., U.S. Patent No. 5,631,169; 
Stavrianopoulos et al. U.S. Patent No. 4,868,103). A fluorophore label on a first, "donor" molecule is 
selected such that its emitted fluorescent energy will be absorbed by a fluorescent label on a second, 
"acceptor" molecule, which in turn is able to fluoresce due to the absorbed energy. Alternately, the 
"donor" polypeptide molecule may simply utilize the natural fluorescent energy of tryptophan residues. 
Labels are chosen that emit different wavelengths of light, such that the "acceptor" molecule label may 
be differentiated from that of the "donor". Since the efficiency of energy transfer between the labels is 
related to the distance separating the molecules, the spatial relationship between the molecules can be 
assessed. In a situation in which binding occurs between the molecules, the fluorescent emission of the 
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"acceptor" molecule label in the assay should be maximal. An FET binding event can be conveniently 
measured through standard fluorometric detection means well known in the art (e.g. , using a fluorimeter). 

[0145] In another embodiment, determining the presence or absence of an interaction between a test 
molecule and a target molecule can be effected by monitoring surface plasmon resonance (see, e.g. , 
Sjolander& Urbaniczk, Anal. Chem. 63: 2338-2345 (1991) and Szabo et al., Curr. Opin. Struct. Biol. 5: 
699-705 (1995)). "Surface plasmon resonance" or "biomolecular interaction analysis (BIA)" can be 
utilized to detect biospecific interactions in real time, without labeling any of the interactants (e.g. , 
BIAcore). Changes in the mass at the binding surface (indicative of a binding event) result in alterations 
of the refractive index of light near the surface (the optical phenomenon of surface plasmon resonance 
(SPR)), resulting in a detectable signal which can be used as an indication of real-time reactions between 
biological molecules. 

[0146] In another embodiment, the target molecule or test molecules are anchored to a solid phase, 
facilitating the detection of target molecule/test molecule complexes and separation of the complexes 
from free, uncomplexed molecules. The target molecule or test molecule is immobilized to the solid 
support. In an embodiment, the target molecule is anchored to a solid surface, and the test molecule, 
which is not anchored, can be labeled, either directly or indirectly, with detectable labels discussed 
herein. 

[0147] It may be desirable to immobilize a target molecule, an anti-target molecule antibody, and/or 
test molecules to facilitate separation of target molecule/test molecule complexes from uncomplexed 
forms, as well as to accommodate automation of the assay. The attachment between a test molecule 
and/or target molecule and the solid support may be covalent or non-covalent (see, e.g., U.S. Patent No. 
6,022,688 for non-covalent attachments). The solid support may be one or more surfaces of the system, 
such as one or more surfaces in each well of a microtiter plate, a surface of a silicon wafer, a surface of a 
bead (see, e.g., Lam, Nature 354: 82-84 (1991)) that is optionally linked to another solid support, or a 
channel in a microfluidic device, for example. Types of solid supports, linker molecules for covalent and 
non-covalent attachments to solid supports, and methods for immobilizing nucleic acids and other 
molecules to solid supports are well known (see, e.g., U.S. Patent Nos. 6,261,776; 5,900,481; 6,133,436; 
and 6,022,688; and WIPO publication WO 01/18234). 

[0148] In an embodiment, target molecule may be immobilized to surfaces via biotin and 
streptavidin. For example, biotinylated target polypeptide can be prepared from biotin-NHS (N-hydroxy- 
succinimide) using techniques known in the art (e.g., biotinylation kit, Pierce Chemicals, Rockford, IL), 
and immobilized in the wells of streptavidin-coated 96 well plates (Pierce Chemical). In another 
embodiment, a target polypeptide can be prepared as a fusion polypeptide. For example, glutathione-S- 
transferase/target polypeptide fusion can be adsorbed onto glutathione sepharose beads (Sigma Chemical, 
St. Louis, MO) or glutathione derivitized microtiter plates, which are then combined with a test molecule 
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under conditions conducive to complex formation (e.g., at physiological conditions for salt and pH). 
Following incubation, the beads or microtiter plate wells are washed to remove any unbound 
components, or the matrix isimmobilized in the case of beads, and complex formation is determined 
directly or indirectly as described above. Alternatively, the complexes can be dissociated from the 
matrix, and the level of target molecule binding or activity is determined using standard techniques. 

[0149] In an embodiment, the non-immobilized component is added to the coated surface containing 
the anchored component. After the reaction is complete, unreacted components are removed (e.g., by 
washing) under conditions such that a significant percentage of complexes formed will remain 
immobilized to the solid surface. The detection of complexes anchored on the solid surface can be 
accomplished in a number of manners. Where the previously non-immobilized component is pre-labeled, 
the detection of label immobilized on the surface indicates that complexes were formed. Where the 
previously non-immobilized component is not pre-labeled, an indirect label can be used to detect 
complexes anchored on the surface, e.g. , by adding a labeled antibody specific for the immobilized 
component, where the antibody, in turn, can be directly labeled or indirectly labeled with, e.g., a labeled 
anti-Ig antibody. 

[0150] In another embodiment, an assay is performed utilizing antibodies that specifically bind 
target molecule or test molecule but do not interfere with binding of the target molecule to the test 
molecule. Such antibodies can be derivitized to a solid support, and unbound target molecule may be 
immobilized by antibody conjugation. Methods for detecting such complexes, in addition to those 
described above for the GST-immobilized complexes, include immunodetection of complexes using 
antibodies reactive with the target molecule, as well as enzyme-linked assays which rely on detecting an 
enzymatic activity associated with the target molecule. 

[0151] Cell free assays also can be conducted in a liquid phase. In such an assay, reaction products 
are separated from unreacted components, by any of a number of standard techniques, including but not 
limited to: differential centrifugation (see, e.g., Rivas, G., and Minton, Trends Biochem Sci Aug; 18(8): 
284-7 (1993)); chromatography (gel filtration chromatography, ion-exchange chromatography); 
electrophoresis (see, e.g., Ausubel et al., eds. Current Protocols in Molecular Biology , J. Wiley: New 
York (1999)); and immunoprecipitation (see, e.g., Ausubel et al., eds., supra). Media and 
chromatographic techniques are known to one skilled in the art (see, e.g., Heegaard, JMol. Recognit. 
Winter; 11(1-6): 141-8 (1998); Hage & Tweed, J. Chromatogr. B Biomed. Sci. Appl. Oct 10; 699 (1-2): 
499-525 (1997)). Further, fluorescence energy transfer may also be conveniently utilized, as described 
herein, to detect binding without further purification of the complex from solution. 

[0152] In another embodiment, modulators of target molecule expression are identified. For 
example, a cell or cell free mixture is contacted with a candidate compound and the expression of target 
mRNA or target polypeptide is evaluated relative to the level of expression of target mRNA or target 
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polypeptide in the absence of the candidate compound. When expression of target mRNA or target 
polypeptide is greater in the presence of the candidate compound than in its absence, the candidate 
compound is identified as an agonist of target mRNA or target polypeptide expression. Alternatively, 
when expression of target mRNA or target polypeptide is less (e.g. , less with statistical significance) in 
the presence of the candidate compound than in its absence, the candidate compound is identified as an 
antagonist or inhibitor of target mRNA or target polypeptide expression. The level of target mRNA or 
target polypeptide expression can be determined by methods described herein. 

[0153] In another embodiment, binding partners that interact with a target molecule are detected. 
The target molecules can interact with one or more cellular or extracellular macromolecules, such as 
polypeptides in vivo, and these interacting molecules are referred to herein as "binding partners." 
Binding partners can agonize or antagonize target molecule biological activity. Also, test molecules that 
agonize or antagonize interactions between target molecules and binding partners can be useful as 
therapeutic molecules as they can up-regulate or down-regulated target molecule activity in vivo and 
thereby treat osteoarthritis. 

[0154] Binding partners of target molecules can be identified by methods known in the art. For 
example, binding partners may be identified by lysing cells and analyzing cell lysates by electrophoretic 
techniques. Alternatively, a two-hybrid assay or three-hybrid assay can be utilized (see, e.g., U.S. Patent 
No. 5,283,317; Zervos etal., Cell 72:223-232 (1993); Madura etal., J. Biol. Chem. 268: 12046-12054 
(1993); Bartel etal., Biotechniques 14: 920-924 (1993); Iwabuchi et al, Oncogene 8: 1693-1696 (1993); 
and Brent WO94/10300). A two-hybrid system is based on the modular nature of most transcription 
factors, which consist of separable DNA-binding and activation domains. The assay often utilizes two 
different DNA constructs. In one construct, a APOL3 nucleic acid (sometimes referred to as the "bait") is 
fused to a gene encoding the DNA binding domain of a known transcription factor (e.g., GAL-4). In 
another construct, a DNA sequence from a library of DNA sequences that encodes a potential binding 
partner (sometimes referred to as the "prey") is fused to a gene that encodes an activation domain of the 
known transcription factor. Sometimes, a APOL3 nucleic acid can be fused to the activation domain. If 
the "bait" and the "prey" molecules interact in vivo, the DNA-binding and activation domains of the 
transcription factor are brought into close proximity. This proximity allows transcription of a reporter 
gene (e.g. , LacZ) which is operably linked to a transcriptional regulatory site responsive to the 
transcription factor. Expression of the reporter gene can be detected and cell colonies containing the 
functional transcription factor can be isolated and used to identify the potential binding partner. 

[0155] In an embodiment for identifying test molecules that antagonize or agonize complex 
formation between target molecules and binding partners, a reaction mixture containing the target 
molecule and the binding partner is prepared, under conditions and for a time sufficient to allow complex 
formation. The reaction mixture often is provided in the presence or absence of the test molecule. The 
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test molecule can be included initially in the reaction mixture, or can be added at a time subsequent to the 
addition of the target molecule and its binding partner. Control reaction mixtures are incubated without 
the test molecule or with a placebo. Formation of any complexes between the target molecule and the 
binding partner then is detected. Decreased formation of a complex in the reaction mixture containing 
test molecule as compared to in a control reaction mixture indicates that the molecule antagonizes target 
molecule/binding partner complex formation. Alternatively, increased formation of a complex in the 
reaction mixture containing test molecule as compared to in a control reaction mixture indicates that the 
molecule agonizes target molecule/binding partner complex formation. In another embodiment, complex 
formation of target molecule/binding partner can be compared to complex formation of mutant target 
molecule/binding partner (e.g., amino acid modifications in a target polypeptide). Such a comparison can 
be important in those cases where it is desirable to identify test molecules that modulate interactions of 
mutant but not non-mutated target gene products. 

[0156] The assays can be conducted in a heterogeneous or homogeneous format. In heterogeneous 
assays, target molecule and/or the binding partner are immobilized to a solid phase, and complexes are 
detected on the solid phase at the end of the reaction. In homogeneous assays, the entire reaction is 
carried out in a liquid phase. In either approach, the order of addition of reactants can be varied to obtain 
different information about the molecules being tested. For example, test compounds that agonize target 
molecule/binding partner interactions can be identified by conducting the reaction in the presence of the 
test molecule in a competition format. Alternatively, test molecules that agonize preformed complexes, 
e.g., molecules with higher binding constants that displace one of the components from the complex, can 
be tested by adding the test compound to the reaction mixture after complexes have been formed. 

[0157] In a heterogeneous assay embodiment, the target molecule or the binding partner is anchored 
onto a solid surface (e.g., a microtiter plate), while the non-anchored species is labeled, either directly or 
indirectly. The anchored molecule can be immobilized by non-covalent or covalent attachments. 
Alternatively, an immobilized antibody specific for the molecule to be anchored can be used to anchor 
the molecule to the solid surface. The partner of the immobilized species is exposed to the coated surface 
with or without the test molecule. After the reaction is complete, unreacted components are removed 
(e.g., by washing) such that a significant portion of any complexes formed will remain immobilized on 
the solid surface. Where the non-immobilized species is pre-labeled, the detection of label immobilized 
on the surface is indicative of complex. Where the non-immobilized species is not pre-labeled, an 
indirect label can be used to detect complexes anchored to the surface; e.g. , by using a labeled antibody 
specific for the initially non-immobilized species. Depending upon the order of addition of reaction 
components, test compounds that inhibit complex formation or that disrupt preformed complexes can be 
detected. 
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[0158] In another embodiment, the reaction can be conducted in a liquid phase in the presence or 
absence of test molecule, where the reaction products are separated from unreacted components, and the 
complexes are detected (e.g. , using an immobilized antibody specific for one of the binding components 
to anchor any complexes formed in solution, and a labeled antibody specific for the other partner to 
detect anchored complexes). Again, depending upon the order of addition of reactants to the liquid 
phase, test compounds that inhibit complex or that disrupt preformed complexes can be identified. 

[0159] In an alternate embodiment, a homogeneous assay can be utilized. For example, a preformed 
complex of the target gene product and the interactive cellular or extracellular binding partner product is 
prepared. One or both of the target molecule or binding partner is labeled, and the signal generated by 
the label(s) is quenched upon complex formation (e.g., U.S. Patent No. 4,109,496 that utilizes this 
approach for immunoassays). Addition of a test molecule that competes with and displaces one of the 
species from the preformed complex will result in the generation of a signal above background. In this 
way, test substances that disrupt target molecule/binding partner complexes can be identified. 

[0160] Candidate therapeutics for treating osteoarthritis are identified from a group of test molecules 
that interact with a target molecule. Test molecules are normally ranked according to the degree with 
which they modulate (e.g., agonize or antagonize) a function associated with the target molecule (e.g., 
DNA replication and/or processing, RNA transcription and/or processing, polypeptide production and/or 
processing, and/or biological function/activity), and then top ranking modulators are selected. Also, 
pharmacogenomic information described herein can determine the rank of a modulator. The top 10% of 
ranked test molecules often are selected for further testing as candidate therapeutics, and sometimes the 
top 15%, 20%, or 25% of ranked test molecules are selected for further testing as candidate therapeutics. 
Candidate therapeutics typically are formulated for administration to a subject. 

Therapeutic Formulations 

[0161] Formulations and pharmaceutical compositions typically include in combination with a 
pharmaceutically acceptable carrier one or more target molecule modulators. The modulator often is a 
test molecule identified as having an interaction with a target molecule by a screening method described 
above. The modulator may be a compound, an antisense nucleic acid, a ribozyme, an antibody, or a 
binding partner. Also, formulations may comprise a target polypeptide or fragment thereof in 
combination with a pharmaceutically acceptable carrier, where the polypeptide or fragment sometimes 
has an APOL3 biological activity (e.g., apolipoprotein activity), and sometimes includes all or part of an 
apolipoprotein domain. 

[0162] As used herein, the term "pharmaceutically acceptable carrier" includes solvents, dispersion 
media, coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, and the like, 
compatible with pharmaceutical administration. Supplementary active compounds can also be 
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incorporated into the compositions. Pharmaceutical compositions can be included in a container, pack, or 
dispenser together with instructions for administration. 

[0163] A pharmaceutical composition typically is formulated to be compatible with its intended 
route of administration. Examples of routes of administration include parenteral, e.g., intravenous, 
intradermal, subcutaneous, oral (e.g., inhalation), transdermal (topical), transmucosal, and rectal 
administration. Solutions or suspensions used for parenteral, intradermal, or subcutaneous application 
can include the following components: a sterile diluent such as water for injection, saline solution, fixed 
oils, polyethylene glycols, glycerin, propylene glycol or other synthetic solvents; antibacterial agents 
such as benzyl alcohol or methyl parabens; antioxidants such as ascorbic acid or sodium bisulfite; 
chelating agents such as ethylenediaminetetraacetic acid; buffers such as acetates, citrates or phosphates 
and agents for the adjustment of tonicity such as sodium chloride or dextrose. pH can be adjusted with 
acids or bases, such as hydrochloric acid or sodium hydroxide. The parenteral preparation can be 
enclosed in ampoules, disposable syringes or multiple dose vials made of glass or plastic. 

[0164] Oral compositions generally include an inert diluent or an edible carrier. For the purpose of 
oral therapeutic administration, the active compound can be incorporated with excipients and used in the 
form of tablets, troches, or capsules, e.g., gelatin capsules. Oral compositions can also be prepared using 
a fluid carrier for use as a mouthwash. Pharmaceutically compatible binding agents, and/or adjuvant 
materials can be included as part of the composition. The tablets, pills, capsules, troches and the like can 
contain any of the following ingredients, or compounds of a similar nature: a binder such as 
microcrystalline cellulose, gum tragacanth or gelatin; an excipient such as starch or lactose, a 
disintegrating agent such as alginic acid, Primogel, or corn starch; a lubricant such as magnesium stearate 
or Sterotes; a glidant such as colloidal silicon dioxide; a sweetening agent such as sucrose or saccharin; 
or a flavoring agent such as peppermint, methyl salicylate, or orange flavoring. 

[0165] Pharmaceutical compositions suitable for injectable use include sterile aqueous solutions 
(where water soluble) or dispersions and sterile powders for the extemporaneous preparation of sterile 
injectable solutions or dispersion. For intravenous administration, suitable carriers include physiological 
saline, bacteriostatic water, Cremophor EL™ (BASF, Parsippany, NJ) or phosphate buffered saline 
(PBS). In all cases, the composition must be sterile and should be fluid to the extent that easy 
syringability exists. It should be stable under the conditions of manufacture and storage and must be 
preserved against the contaminating action of microorganisms such as bacteria and fungi. The carrier can 
be a solvent or dispersion medium containing, for example, water, ethanol, polyol (for example, glycerol, 
propylene glycol, and liquid polyethylene glycol, and the like), and suitable mixtures thereof. The proper 
fluidity can be maintained, for example, by the use of a coating such as lecithin, by the maintenance of 
the required particle size in the case of dispersion and by the use of surfactants. Prevention of the action 
of microorganisms can be achieved by various antibacterial and antifungal agents, for example, parabens, 
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chlorobutanol, phenol, ascorbic acid, thimerosal, and the like. In many cases, it will be preferable to 
include isotonic agents, for example, sugars, polyalcohols such as mannitol, sorbitol, sodium chloride in 
the composition. Prolonged absorption of the injectable compositions can be brought about by including 
in the composition an agent which delays absorption, for example, aluminum monostearate and gelatin. 

[0166] Sterile injectable solutions can be prepared by incorporating the active compound in the 
required amount in an appropriate solvent with one or a combination of ingredients enumerated above, as 
required, followed by filtered sterilization. Generally, dispersions are prepared by incorporating the 
active compound into a sterile vehicle which contains a basic dispersion medium and the required other 
ingredients from those enumerated above. In the case of sterile powders for the preparation of sterile 
injectable solutions, the preferred methods of preparation are vacuum drying and freeze-drying which 
yields a powder of the active ingredient plus any additional desired ingredient from a previously sterile- 
filtered solution thereof. 

[0167] For administration by inhalation, the compounds are delivered in the form of an aerosol 
spray from pressured container or dispenser which contains a suitable propellant, e.g., a gas such as 
carbon dioxide, or a nebulizer. 

[0168] Systemic administration can also be by transmucosal or transdermal means. For 
transmucosal or transdermal administration, penetrants appropriate to the barrier to be permeated are 
used in the formulation. Such penetrants are generally known in the art, and include, for example, for 
transmucosal administration, detergents, bile salts, and fusidic acid derivatives. Transmucosal 
administration can be accomplished through the use of nasal sprays or suppositories. For transdermal 
administration, the active compounds are formulated into ointments, salves, gels, or creams as generally 
known in the art. Molecules can also be prepared in the form of suppositories (e.g., with conventional 
suppository bases such as cocoa butter and other glycerides) or retention enemas for rectal delivery. 

[0169] In one embodiment, active molecules are prepared with carriers that will protect the 
compound against rapid elimination from the body, such as a controlled release formulation, including 
implants and microencapsulated delivery systems. Biodegradable, biocompatible polymers can be used, 
such as ethylene vinyl acetate, polyanhydrides, polyglycolic acid, collagen, polyorthoesters, and 
polylactic acid. Methods for preparation of such formulations will be apparent to those skilled in the art. 
Materials can also be obtained commercially from Alza Corporation and Nova Pharmaceuticals, Inc. 
Liposomal suspensions (including liposomes targeted to infected cells with monoclonal antibodies to 
viral antigens) can also be used as pharmaceutically acceptable carriers. These can be prepared 
according to methods known to those skilled in the art, for example, as described in U.S. Patent 
No. 4,522,811. 

[0170] It is advantageous to formulate oral or parenteral compositions in dosage unit form for ease 
of administration and uniformity of dosage. Dosage unit form as used herein refers to physically discrete 
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units suited as unitary dosages for the subject to be treated; each unit containing a predetermined quantity 
of active compound calculated to produce the desired therapeutic effect in association with the required 
pharmaceutical carrier. 

[0171] Toxicity and therapeutic efficacy of such compounds can be determined by standard 
pharmaceutical procedures in cell cultures or experimental animals, e.g., for determining the LD 50 (the 
dose lethal to 50% of the population) and the ED 50 (the dose therapeutically effective in 50% of the 
population). The dose ratio between toxic and therapeutic effects is the therapeutic index and it can be 
expressed as the ratio LD 5 o/ED S o. Molecules which exhibit high therapeutic indices are preferred. While 
molecules that exhibit toxic side effects may be used, care should be taken to design a delivery system 
that targets such compounds to the site of affected tissue in order to rninirnize potential damage to 
uninfected cells and, thereby, reduce side effects. 

[0172] The data obtained from the cell culture assays and animal studies can be used in formulating 
a range of dosage for use in humans. The dosage of such molecules lies preferably within a range of 
circulating concentrations that include the ED 5 o with little or no toxicity. The dosage may vary within 
this range depending upon the dosage form employed and the route of administration utilized. For any 
molecules used in the methods described herein, the therapeutically effective dose can be estimated 
initially from cell culture assays. A dose may be formulated in animal models to achieve a circulating 
plasma concentration range that includes the IC50 (i.e., the concentration of the test compound which 
achieves a half-maximal inhibition of symptoms) as determined in cell culture. Such information can be 
used to more accurately determine useful doses in humans. Levels in plasma may be measured, for 
example, by high performance liquid chromatography. 

[0173] As defined herein, a therapeutically effective amount of protein or polypeptide (i.e., an 
effective dosage) ranges from about 0.001 to 30 mg/kg body weight, sometimes about 0.01 to 25 mg/kg 
body weight, often about 0.1 to 20 mg/kg body weight, and more often about 1 to 10 mg/kg, 2 to 9 
mg/kg, 3 to 8 mg/kg, 4 to 7 mg/kg, or 5 to 6 mg/kg body weight. The protein or polypeptide can be 
administered one time per week for between about 1 to 10 weeks, sometimes between 2 to 8 weeks, often 
between about 3 to 7 weeks, and more often for about 4, 5, or 6 weeks. The skilled artisan will 
appreciate that certain factors may influence the dosage and timing required to effectively treat a subject, 
including but not limited to the severity of the disease or disorder, previous treatments, the general health 
and/or age of the subject, and other diseases present. Moreover, treatment of a subject with a 
therapeutically effective amount of a protein, polypeptide, or antibody can include a single treatment or, 
preferably, can include a series of treatments. 

[0174] With regard to polypeptide formulations, featured herein is a method for treating 
osteoarthritis in a subject, which comprises contacting one or more cells in the subject with a first 
polypeptide, where the subject comprises a second polypeptide having one or more polymorphic 
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variations associated with cancer, and where the first polypeptide comprises fewer polymorphic 
variations associated with cancer than the second polypeptide. The first and second polypeptides are 
encoded by a nucleic acid which comprises a nucleotide sequence in SEQ ID NO: 1-7; a nucleotide 
sequence which encodes a polypeptide consisting of an amino acid sequence encoded by a nucleotide 
sequence referenced in SEQ ID NO: 1-7; a nucleotide sequence which encodes a polypeptide that is 90% 
or more identical to an amino acid sequence encoded by a nucleotide sequence of SEQ ID NO: 1-7 and a 
nucleotide sequence 90% or more identical to a nucleotide sequence in SEQ ID NO: 1-7. The subject 
often is a human. 

[0175] For antibodies, a dosage of 0.1 mg/kg of body weight (generally 10 mg/kg to 20 mg/kg) is 
often utilized. If the antibody is to act in the brain, a dosage of 50 mg/kg to 100 mg/kg is often 
appropriate. Generally, partially human antibodies and fully human antibodies have a longer half-life 
within the human body than other antibodies. Accordingly, lower dosages and less frequent 
administration is often possible. Modifications such as lipidation can be used to stabilize antibodies and 
to enhance uptake and tissue penetration (e.g. , into the brain). A method for lipidation of antibodies is 
described by Cruikshank et ai, J. Acquired Immune Deficiency Syndromes and Human Retrovirology 
14:192 (1997). 

[0176] Antibody conjugates can be used for modifying a given biological response, the drug moiety 
is not to be construed as limited to classical chemical therapeutic agents. For example, the drug moiety 
may be a protein or polypeptide possessing a desired biological activity. Such proteins may include, for 
example, a toxin such as abrin, ricin A, pseudomonas exotoxin, or diphtheria toxin; a polypeptide such as 
tumor necrosis factor, alpha-interferon, beta-interferon, nerve growth factor, platelet derived growth 
factor, tissue plasminogen activator; or, biological response modifiers such as, for example, lymphokines, 
interleukin-1 ("IL-l"), interleukin-2 ("IL-2"), interleukin-6 ("IL-6"), granulocyte macrophage colony 
stimulating factor ("GM-CSF"), granulocyte colony stimulating factor ("G-CSF"), or other growth 
factors. Alternatively, an antibody can be conjugated to a second antibody to form an antibody 
heteroconjugate as described by Segal in U.S. Patent No. 4,676,980. 

[0177] For compounds, exemplary doses include milligram or microgram amounts of the compound 
per kilogram of subject or sample weight, for example, about 1 microgram per kilogram to about 500 
milligrams per kilogram, about 1 00 micrograms per kilogram to about 5 milligrams per kilogram, or 
about 1 microgram per kilogram to about 50 micrograms per kilogram. It is understood that appropriate 
doses of a small molecule depend upon the potency of the small molecule with respect to the expression 
or activity to be modulated. When one or more of these small molecules is to be administered to an 
animal (e.g., a human) in order to modulate expression or activity of a polypeptide or nucleic acid 
described herein, a physician, veterinarian, or researcher may, for example, prescribe a relatively low 
dose at first, subsequently increasing the dose until an appropriate response is obtained. In addition, it is 
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understood that the specific dose level for any particular animal subject will depend upon a variety of 
factors including the activity of the specific compound employed, the age, body weight, general health, 
gender, and diet of the subject, the time of administration, the route of administration, the rate of 
excretion, any drug combination, and the degree of expression or activity to be modulated. 

[0178] With regard to nucleic acid formulations, gene therapy vectors can be delivered to a subject 
by, for example, intravenous injection, local administration (see, e.g., U.S. Patent 5,328,470) or by 
stereotactic injection (see e.g., Chen etal, (1994) Proc. Natl. Acad. Sci. USA 97:3054-3057). 
Pharmaceutical preparations of gene therapy vectors can include a gene therapy vector in an acceptable 
diluent, or can comprise a slow release matrix in which the gene delivery vehicle is imbedded. 
Alternatively, where the complete gene delivery vector can be produced intact from recombinant cells 
(e.g. , retroviral vectors) the pharmaceutical preparation can include one or more cells which produce the 
gene delivery system. Examples of gene delivery vectors are described herein. 

Therapeutic Methods 

[0179] A therapeutic formulation described above can be administered to a subject in need of a 
therapeutic for inducing a desired biological response.. Therapeutic formulations can be administered by 
any of the paths described herein. With regard to both prophylactic and therapeutic methods of 
treatment, such treatments may be specifically tailored or modified, based on knowledge obtained from 
pharmacogenomic analyses described herein. 

[0180] As used herein, the term "treatment" is defined as the application or administration of a 
therapeutic formulation to a subject, or application or administration of a therapeutic agent to an isolated 
tissue or cell line from a subject with the purpose to cure, heal, alleviate, relieve, alter, remedy, 
ameliorate, improve or affect osteoarthritis, symptoms of osteoarthritis or a predisposition towards 
osteoarthritis. A therapeutic formulation includes, but is not limited to, small molecules, peptides, 
antibodies, ribozymes and antisense oligonucleotides. Administration of a therapeutic formulation can 
occur prior to the manifestation of symptoms characteristic of osteoarthritis, such that osteoarthritis is 
prevented or delayed in its progression. The appropriate therapeutic composition can be determined 
based on screening assays described herein. 

[0181] As discussed, successful treatment of osteoarthritis can be brought about by techniques that 
serve to agonize target molecule expression or function, or alternatively, antagonize target molecule 
expression or function. These techniques include administration of modulators that include, but are not 
limited to, small organic or inorganic molecules; antibodies (including, for example, polyclonal, 
monoclonal, humanized, anti-idiotypic, chimeric or single chain antibodies, and Fab, F(ab')2 and Fab 
expression library fragments, scFV molecules, and epitope-binding fragments thereof); and peptides, 
phosphopeptides, or polypeptides. 
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[0182] Further, antisense and ribozyme molecules that inhibit expression of the target gene can also 
be used to reduce the level of target gene expression, thus effectively reducing the level of target gene 
activity. Still further, triple helix molecules can be utilized in reducing the level of target gene activity. 
Antisense, ribozyme and triple helix molecules are discussed above. It is possible that the use of 
antisense, ribozyme, and/or triple helix molecules to reduce or inhibit mutant gene expression can also 
reduce or inhibit the transcription (triple helix) and/or translation (antisense, ribozyme) of mRNA 
produced by normal target gene alleles, such that the concentration of normal target gene product present 
can be lower than is necessary for a normal phenotype. In such cases, nucleic acid molecules that encode 
and express target gene polypeptides exhibiting normal target gene activity can be introduced into cells 
via gene therapy method. Alternatively, in instances in that the target gene encodes an extracellular 
polypeptide, it can be preferable to co-administer normal target gene polypeptide into the cell or tissue in 
order to maintain the requisite level of cellular or tissue target gene activity. 

[0183] Another method by which nucleic acid molecules may be utilized in treating or preventing 
osteoarthritis is use of aptamer molecules specific for target molecules. Aptamers are nucleic acid 
molecules having a tertiary structure which permits them to specifically bind to ligands {see, e.g., 
Osborne, etal., Curr. Opin. Chem. Biol. 1(1): 5-9 (1997); and Patel, D. J., Curr. Opin. Chem. Biol. 
Jun;l(l): 32-46 (1997)). 

[0184] Yet another method of utilizing nucleic acid molecules for osteoarthritis treatment is gene 
therapy, which can also be referred to as allele therapy. Provided herein is a gene therapy method for 
treating osteoarthritis in a subject, which comprises contacting one or more cells in the subject or from 
the subject with a nucleic acid having a first nucleotide sequence (e.g., the first nucleotide sequence is 
identical to or substantially identical to the nucleotide sequence of SEQ ID NO: 1-7). Genomic DNA in 
the subject comprises a second nucleotide sequence having one or more polymorphic variations 
associated with osteoarthritis (e.g., the second nucleotide sequence is identical to or substantially 
identical to a nucleotide sequence of SEQ ID NO: 1). The first and second nucleotide sequences 
typically are substantially identical to one another, and the first nucleotide sequence comprises fewer 
polymorphic variations associated with osteoarthritis than the second nucleotide sequence. The first 
nucleotide sequence may comprise a gene sequence that encodes a full-length polypeptide or a fragment 
thereof. The subject is often a human. Allele therapy methods often are utilized in conjunction with a 
method of first determining whether a subject has genomic DNA that includes polymorphic variants 
associated with osteoarthritis. 

[0185] In another allele therapy embodiment, provided herein is a method which comprises 
contacting one or more cells in the subject or from the subject with a polypeptide encoded by a nucleic 
acid having a first nucleotide sequence (e.g., the first nucleotide sequence is identical to or substantially 
identical to the nucleotide sequence of SEQ ED NO: 1-7). Genomic DNA in the subject comprises a 
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second nucleotide sequence having one or more polymorphic variations associated with osteoarthritis 
(e.g., the second nucleic acid has a nucleotide sequence identical to or substantially identical to the 
nucleotide sequence of SEQ ID NO: 1). The first and second nucleotide sequences typically are 
substantially identical to one another, and the first nucleotide sequence comprises fewer polymorphic 
variations associated with osteoarthritis than the second nucleotide sequence. The first nucleotide 
sequence may comprise a gene sequence that encodes a full-length polypeptide or a fragment thereof. 
The subject is often a human. 

[0186] For antibody-based therapies, antibodies can be generated that are both specific for target 
molecules and that reduce target molecule activity. Such antibodies may be administered in instances 
where antagonizing a target molecule function is appropriate for the treatment of osteoarthritis. 

[0187] In circumstances where stimulating antibody production in an animal or a human subject by 
injection with a target molecule is harmful to the subject, it is possible to generate an immune response 
against the target molecule by use of anti-idiotypic antibodies (see, e.g., Herlyn, Ann. Med.;31(l): 66-78 
(1999); and Bhattacharya-Chatterjee & Foon, Cancer Treat. Res.; 94: 51-68 (1998)). Introducing an 
anti-idiotypic antibody to a mammal or human subject often stimulates production of anti-anti-idiotypic 
antibodies, which typically are specific to the target molecule. Vaccines directed to osteoarthritis also 
may be generated in this fashion. 

[0188] In instances where the target molecule is intracellular and whole antibodies are used, 
internalizing antibodies may be preferred. Lipofectin or liposomes can be used to deliver the antibody or 
a fragment of the Fab region that binds to the target antigen into cells. Where fragments of the antibody 
are used, the smallest inhibitory fragment that binds to the target antigen is preferred. For example, 
peptides having an amino acid sequence corresponding to the Fv region of the antibody can be used. 
Alternatively, single chain neutralizing antibodies that bind to intracellular target antigens can also be 
administered. Such single chain antibodies can be administered, for example, by expressing nucleotide 
sequences encoding single-chain antibodies within the target cell population (see, e.g., Marasco et al., 
Proc. Natl. Acad. Sci. USA 90: 7889-7893 (1993)). 

[0189] Modulators can be administered to a patient at therapeutically effective doses to treat 
osteoarthritis. A therapeutically effective dose refers to an amount of the modulator sufficient to result in 
amelioration of symptoms of osteoarthritis. Toxicity and therapeutic efficacy of modulators can be 
determined by standard pharmaceutical procedures in cell cultures or experimental animals, e.g. , for 
determining the LD 50 (the dose lethal to 50% of the population) and the ED 50 (the dose therapeutically 
effective in 50% of the population). The dose ratio between toxic and therapeutic effects is the 
therapeutic index and it can be expressed as the ratio LD50/ED50. Modulators that exhibit large 
therapeutic indices are preferred. While modulators that exhibit toxic side effects can be used, care 
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should be taken to design a delivery system that targets such molecules to the site of affected tissue in 
order to minimize potential damage to uninfected cells, thereby reducing side effects. 

[0190] Data obtained from cell culture assays and animal studies can be used in formulating a range 
of dosages for use in humans. The dosage of such compounds lies preferably within a range of 
circulating concentrations that include the ED 50 with little or no toxicity. The dosage can vary within this 
range depending upon the dosage form employed and the route of administration utilized. For any 
compound used in the methods described herein, the therapeutically effective dose can be estimated 
initially from cell culture assays. A dose can be formulated in animal models to achieve a circulating 
plasma concentration range that includes the IC 5 o (i.e., the concentration of the test compound that 
achieves a half-maximal inhibition of symptoms) as determined in cell culture. Such information can be 
used to more accurately determine useful doses in humans. Levels in plasma can be measured, for 
example, by high performance liquid chromatography. 

[0191] Another example of effective dose determination for an individual is the ability to directly 
assay levels of "free" and "bound" compound in the serum of the test subject. Such assays may utilize 
antibody mimics and/or "biosensors" that have been created through molecular imprinting techniques. 
Molecules that modulate target molecule activity are used as a template, or "imprinting molecule", to 
spatially organize polymerizable monomers prior to their polymerization with catalytic reagents. The 
subsequent removal of the imprinted molecule leaves a polymer matrix which contains a repeated 
"negative image" of the compound and is able to selectively rebind the molecule under biological assay 
conditions. A detailed review of this technique can be seen in Ansell et al, Current Opinion in 
Biotechnology 7: 89-94 (1996) and in Shea, Trends in Polymer Science 2: 166-173 (1994). Such 
"imprinted" affinity matrixes are amenable to ligand-binding assays, whereby the immobilized 
monoclonal antibody component is replaced by an appropriately imprinted matrix. An example of the 
use of such matrixes in this way can be seen in Vlatakis, et al.. Nature 361: 645-647 (1993). Through the 
use of isotope-labeling, the "free" concentration of compound which modulates target molecule 
expression or activity readily can be monitored and used in calculations of IC 50 . Such "imprinted" 
affinity matrixes can also be designed to include fluorescent groups whose photon-emitting properties 
measurably change upon local and selective binding of target compound. These changes readily can be 
assayed in real time using appropriate fiberoptic devices, in turn allowing the dose in a test subject to be 
quickly optimized based on its individual 1C 50 . An example of such a "biosensor" is discussed in Kriz et 
al., Analytical Chemistry 67: 2142-2144 (1995). 

[0192] The examples set forth below are intended to illustrate but not limit the invention. 
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Examples 

[0193] In the following studies a group of subjects was selected according to specific parameters 
relating to osteoarthritis (OA). Nucleic acid samples obtained from individuals in the study group were 
subjected to genetic analysis, which identified associations between osteoarthritis and a polymorphism in 
the APOL3 gene on chromosome twenty-two. The polymorphism was genotyped again in two 
replication cohorts consisting of individuals selected for OA. In addition, SNPs proximal to the incident 
polymorphism were identified and allelotyped in OA case and control pools. Methods are described for 
producing APOL3 polypeptide and APOL3 polypeptide variants in vitro or in vivo, APOL3 nucleic acids 
or polypeptides and variants thereof are utilized for screening test molecules for those that interact with 
APOL3 molecules. Test molecules identified as interactors with APOL3 molecules and APOL3 variants 
are further screened in vivo to determine whether they treat osteoarthritis. 

Example 1 
Samples and Pooling Strategies 

Sample Selection 

[0194] Blood samples were collected from individuals diagnosed with knee osteoarthritis, which 
were referred to as case samples. Also, blood samples were collected from individuals not diagnosed 
with knee osteoarthritis as gender and age-matched controls. A database was created that listed all 
phenotypic trait information gathered from individuals for each case and control sample. Genomic DNA 
was extracted from each of the blood samples for genetic analyses. 

DNA Extraction from Blood Samples 

[0195] Six to ten milliliters of whole blood was transferred to a 50 ml tube containing 27 ml of red 
cell lysis solution (RCL). The tube was inverted until the contents were mixed. Each tube was incubated 
for 10 minutes at room temperature and inverted once during the incubation. The tubes were then 
centrifuged for 20 minutes at 3000 x g and the supernatant was carefully poured off. 100-200 ul of 
residual liquid was left in the tube and was pipetted repeatedly to resuspend the pellet in the residual 
supernatant. White cell lysis solution (WCL) was added to the tube and pipetted repeatedly until 
completely mixed. While no incubation was normally required, the solution was incubated at 37°C or 
room temperature if cell clumps were visible after mixing until the solution was homogeneous. 2 ml of 
protein precipitation was added to the cell lysate. The mixtures were vortexed vigorously at high speed 
for 20 sec to mix the protein precipitation solution uniformly with the cell lysate, and then centrifuged for 
1 0 minutes at 3000 x g. The supernatant containing the DNA was then poured into a clean 1 5 ml tube, 
which contained 7 ml of 100% isopropanol. The samples were mixed by inverting the tubes gently until 

55 



PATENT 
Docket 524593009200 



white threads of DNA were visible. Samples were centrifuged for 3 minutes at 2000 x g and the DNA 
was visible as a small white pellet. The supernatant was decanted and 5 ml of 70% ethanol was added to 
each tube. Each tube was inverted several times to wash the DNA pellet, and then centrifuged for 1 
minute at 2000 x g. The ethanol was decanted and each tube was drained on clean absorbent paper. The 
DNA was dried in the tube by inversion for 10 minutes, and then 1000 ul of IX TE was added. The size 
of each sample was estimated, and less TE buffer was added during the following DNA hydration step if 
the sample was smaller. The DNA was allowed to rehydrate overnight at room temperature, and DNA 
samples were stored at 2-8°C. 

[0196] DNA was quantified by placing samples on a hematology mixer for at least 1 hour. DNA 
was serially diluted (typically 1 :80, 1:160, 1:320, and 1:640 dilutions) so that it would be within the 
measurable range of standards. 125 pi of diluted DNA was transferred to a clear U-bottom microtitre 
plate, and 125 ul of IX TE buffer was transferred into each well using a multichannel pipette. The DNA 
and IX TE were mixed by repeated pipetting at least 15 times, and then the plates were sealed. 50 pi of 
diluted DNA was added to wells A5-H12 of a black flat bottom microtitre plate. Standards were inverted 
six times to mix them, and then 50 pi of IX TE buffer was pipetted into well Al , 1000 ng/ml of standard 
was pipetted into well A2, 500 ng/ml of standard was pipetted into well A3, and 250 ng/ml of standard 
was pipetted into well A4. PicoGreen (Molecular Probes, Eugene, Oregon) was thawed and freshly 
diluted 1 :200 according to the number of plates that were being measured. PicoGreen was vortexed and 
then 50pl was pipetted into all wells of the black plate with the diluted DNA. DNA and PicoGreen were 
mixed by pipetting repeatedly at least 1 0 times with the multichannel pipette. The plate was placed into a 
Fluoroskan Ascent Machine (microplate fluorometer produced by Labsystems) and the samples were 
allowed to incubate for 3 minutes before the machine was run using filter pairs 485 nm excitation and 
538 nm emission wavelengths. Samples having measured DNA concentrations of greater than 450 ng/pl 
were re-measured for conformation. Samples having measured DNA concentrations of 20 ng/pl or less 
were re-measured for confirmation. 

Pooling Strategies - Discovery Cohort 

[0197] Samples were derived from the Nottingham knee OA family study (UK) where index cases 
were identified through a knee replacement registry. Siblings were approached and assessed with knee x- 
rays and assigned status as affected or unaffected. In all 1,157 individuals were available. In order to 
create same-sex pools of appropriate sizes, 335 unrelated female individuals with OA from the 
Nottingham OA sample were selected for the case pool. The control pool was made up of unrelated 
female individuals from the St. Thomas twin study (England) with normal knee x-rays and without other 
indications of OA, regardless of anatomical location, as well as lacking family history of OA. The St. 
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Thomas twin study consists of Caucasian, female participants from the St. Thomas' Hospital, London, 
adult-twin registry, which is a voluntary registry of >4,000 twin pairs ranging from 1 8 to 76 years of age. 
The female case samples and female control samples are described further in Table 1 below. 

[0198] A select set of samples from each group were utilized to generate pools, and one pool was 
created for each group. Each individual sample in a pool was represented by an equal amount of 
genomic DNA. For example, where 25 ng of genomic DNA was utilized in each PCR reaction and there 
were 200 individuals in each pool, each individual would provide 125 pg of genomic DNA. Inclusion or 
exclusion of samples for a pool was based upon the following criteria: the sample was derived from an 
individual characterized as Caucasian; the sample was derived from an individual of British paternal and 
maternal descent; case samples were derived from individuals diagnosed with specific knee osteoarthritis 
(OA) and were recruited from an OA knee replacement clinic. Control samples were derived from 
individuals free of OA, family history of OA, and rheumatoid arthritis. Also, sufficient genomic DNA 
was extracted from each blood sample for all allelotyping and genotyping reactions performed during the 
study. Phenotype information from each individual was collected and included age of the individual, 
gender, family history of OA, general medical information (e.g., height, weight, thyroid disease, 
diabetes, psoriasis, hysterectomy), joint history (previous and current symptoms, joint-related operations, 
age at onset of symptoms, date of primary diagnosis, age of individual as of primary diagnosis and order 
of involvement), and knee-related findings (crepitus, restricted passive movement, bony 
swelling/deformity). Additional knee information included knee history, current symptoms, any major 
knee injury, menisectomy, knee replacement surgery, age of surgery, and treatment history (including 
hormone replace therapy (HRT)). Samples that met these criteria were added to appropriate pools based 
on disease status. 

[0199] The selection process yielded the pools set forth in Table 1 , which were used in the studies 
that follow: 



TABLE 1 





Female case 


Female control 


Pool size 
(Number) 


335 


335 


Pool Criteria 

(ex: case/control) 


control 


case 


Mean Age 

(ex: years) 


57.21 


69.95 
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Example 2 

Association of Polymorphic Variants with Osteoarthritis 

[0200] A whole-genome screen was performed to identify particular SNPs associated with 
occurrence of osteoarthritis. As described in Example 1 , two sets of samples were utilized, which 
included samples from female individuals having knee osteoarthritis (osteoarthritis cases), and samples 
from female individuals not having knee osteoarthritis (female controls). The initial screen of each pool 
was performed in an allelotyping study, in which certain samples in each group were pooled. By pooling 
DNA from each group, an allele frequency for each SNP in each group was calculated. These allele 
frequencies were then compared to one another. Particular SNPs were considered as being associated 
with osteoarthritis when allele frequency differences calculated between case and control pools were 
statistically significant. SNP disease association results obtained from the allelotyping study were then 
validated by genotyping each associated SNP across all samples from each pool. The results of the 
genotyping then were analyzed, allele frequencies for each group were calculated from the individual 
genotyping results, and a p-value was calculated to determine whether the case and control groups had 
statistically significant differences in allele frequencies for a particular SNP. When the genotyping 
results agreed with the original allelotyping results, the SNP disease association was considered validated 
at the genetic level. 

SNP Panel Used for Genetic Analyses 

[0201] A whole-genome SNP screen began with an initial screen of approximately 25,000 SNPs 
over each set of disease and control samples using a pooling approach. The pools studied in the screen 
are described in Example 1 . The SNPs analyzed in this study were part of a set of 25,488 SNPs 
confirmed as being statistically polymorphic as each is characterized as having a minor allele frequency 
of greater than 10%. The SNPs in the set reside in genes or in close proximity to genes, and many reside 
in gene exons. Specifically, SNPs in the set are located in exons, introns, and within 5,000 base-pairs 
upstream of a transcription start site of a gene. In addition, SNPs were selected according to the 
following criteria: they are located in ESTs; they are located in Locuslink or Ensembl genes; and they are 
located in Genomatix promoter predictions. SNPs in the set were also selected on the basis of even 
spacing across the genome, as depicted in Table 2. 

[0202] A case-control study design using a whole genome association strategy involving 
approximately 28,000 single nucleotide polymorphisms (SNPs) was employed. Approximately 25,000 
SNPs were evenly spaced in gene-based regions of the human genome with a median inter-marker 
distance of about 40,000 base pairs. Additionally, approximately 3,000 SNPs causing amino acid 
substitutions in genes described in the literature as candidates for various diseases were used. The case- 
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control study samples were of female Caucasian origin (British paternal and maternal descent) 670 
individuals were equally distributed in two groups: female controls and female cases. The whole genome 
association approach was first conducted on 2 DNA pools representing the 2 groups. Significant markers 
were confirmed by individual genotyping. 



TABLE 2 



General Statistics 


Spacing Statistics 


Total # of SNPs 


25,488 


Median 


37,058 bp 


#ofExonic SNPs 


>4,335(17%) 


Minimum* 


1,000 bp 


# SNPs with refSNP ID 


20,776 (81%) 


Maximum* 


3,000,000 bp 


Gene Coverage 


>1 0,000 


Mean 


122,412 bp 


Chromosome Coverage 


All 


Std Deviation 


373,325 bp 






^Excludes outliers 





Allelotyping and Genotyping Results 

[0203] The genetic studies summarized above and described in more detail below identified an 
allelic variant in the APOL3 gene that is associated with osteoarthritis. 

Assay for Verifying. Allelotyping. and Genotyping SNPs 

[0204] A MassARRAY™ system (Sequenom, Inc.) was utilized to perform SNP genotyping in a 
high-throughput fashion. This genotyping platform was complemented by a homogeneous, single-tube 
assay method (hME™ or homogeneous MassEXTEND™ (Sequenom, Inc.)) in which two genotyping 
primers anneal to and amplify a genomic target surrounding a polymorphic site of interest. A third 
primer (the MassEXTEND™ primer), which is complementary to the amplified target up to but not 
including the polymorphism, was then enzymatically extended one or a few bases through the 
polymorphic site and then terminated. 

[0205] For each polymorphism, SpectroDESIGNER™ software (Sequenom, Inc.) was used to 
generate a set of PCR primers and a MassEXTEND™ primer which where used to genotype the 
polymorphism. Other primer design software could be used or one of ordinary skill in the art could 
manually design primers based on his or her knowledge of the relevant factors and considerations in 
designing such primers. Table 3 shows PCR primers and Table 4 shows extension primers used for 
analyzing polymorphisms. The initial PCR amplification reaction was performed in a 5 pi total volume 
containing IX PCR buffer with 1.5 mM MgCI 2 (Qiagen), 200 uM each of dATP, dGTP, dCTP, dTTP 
(Gibco-BRL), 2.5 ng of genomic DNA, 0.1 units of HotStar DNA polymerase (Qiagen), and 200 nM 
each of forward and reverse PCR primers specific for the polymorphic region of interest. 
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TABLE 3: PGR Primers 



SNP 
Reference 


Forward PCR primer 


Reverse PCR primer 


rs1 32659 


ACGTTGGATGGGCCCATAGTGGGTCATAAC 


ACGTTGGATGGTGGGGTGAGTGCCCAAAAG 



[0206] Samples were incubated at 95°C for 1 5 minutes, followed by 45 cycles of 95°C for 20 
seconds, 56°C for 30 seconds, and 72°C for 1 minute, finishing with a 3 minute final extension at 72°C. 
Following amplification, shrimp alkaline phosphatase (SAP) (0.3 units in a 2 ul volume) (Amersham 
Pharmacia) was added to each reaction (total reaction volume was 7 ul) to remove any residual dNTPs 
that were not consumed in the PCR step. Samples were incubated for 20 minutes at 37°C, followed by 5 
minutes at 85°C to denature the SAP. 

[0207] Once the SAP reaction was complete, a primer extension reaction was initiated by adding a 
polymorphism-specific MassEXTEND™ primer cocktail to each sample. Each MassEXTEND™ 
cocktail included a specific combination of dideoxynucleotides (ddNTPs) and deoxynucleotides (dNTPs) 
used to distinguish polymorphic alleles from one another. Methods for verifying, allelotyping and 
genotyping SNPs are disclosed, for example, in U.S. Pat. No. 6,258,538, the content of which is hereby 
incorporated by reference. In Table 4, ddNTPs are shown and the fourth nucleotide not shown is the 
dNTP. 



TABLE 4: Extension Primers 



SNP 
Reference 


Extend Probe 


Termination 
Mix 


rs 132659 


AGAACTCCCCAAATCGTCCT 


ACG 



[0208] The MassEXTEND™ reaction was performed in a total volume of 9 ul, with the addition of 
1 X ThermoSequenase buffer, 0.576 units of ThermoSequenase (Amersham Pharmacia), 600 nM 
MassEXTEND™ primer, 2 mM of ddATP and/or ddCTP and/or ddGTP and/or ddTTP, and 2 mM of 
dATP or dCTP or dGTP or dTTP. The deoxy nucleotide (dNTP) used in the assay normally was 
complementary to the nucleotide at the polymorphic site in the amplicon. Samples were incubated at 
94°C for 2 minutes, followed by 55 cycles of 5 seconds at 94°C, 5 seconds at 52°C, and 5 seconds 
at 72°C. 

[0209] Following incubation, samples were desalted by adding 16 ul of water (total reaction volume 
was 25 ul), 3 mg of SpectroCLEAN™ sample cleaning beads (Sequenom, Inc.) and allowed to incubate 
for 3 minutes with rotation. Samples were then robotically dispensed using a piezoelectric dispensing 
device (SpectroJET™ (Sequenom, Inc.)) onto either 96-spot or 384-spot silicon chips containing a matrix 
that crystallized each sample (SpectroCHIP™ (Sequenom, Inc.)). Subsequently, MALDI-TOF mass 
spectrometry (Biflex and Autoflex MALDI-TOF mass spectrometers (Bruker Daltonics) can be used) and 
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SpectroTYPER RT™ software (Sequenom, Inc.) were used to analyze and interpret the SNP genotype for 
each sample. 

Genetic Analysis 

[0210] Minor allelic frequencies for the polymorphisms set forth in Table A were verified as being 
10% or greater using the extension assay described above in a group of samples isolated from 92 
individuals originating from the state of Utah in the United States, Venezuela and France (Coriell cell 
repositories). 

[0211] Genotyping results are shown for female pools in Table 5. In Table 5, "AF" refers to allelic 
frequency; and "F case" and "F control" refer to female case and female control groups, respectively. 



TABLE 5: Genotyping Results 



SNP Reference 


AF 
F case 


AF 
F control 


p-value 


rs1 32659 


C = 0.675 
T = 0.325 


C = 0.589 
T = 0.411 


0.0015 



[0212] The single marker alleles set forth in Table A were considered validated, since the 
genotyping data agreed with the allelotyping data and each SNP significantly associated with 
osteoarthritis. Particularly significant associations with osteoarthritis are indicated by a calculated p- 
value of less than 0.05 for genotype results. 

Example 3 

Association of Polymorphic Variants with Osteoarthritis in Replication Cohorts 

[0213] The single marker polymorphism set forth in Table A was genotyped again in two replication 
cohorts consisting of individuals selected for OA. 

Sample Selection and Pooling Strategies — Replication Sample 1 

[0214] A second case control sample (replication sample #1) was created by using 100 Caucasian 
female cases from Chingford, UK, and 148 unrelated female cases from the St. Thomas twin study. 
Cases were defined as having Kellgren-Lawrence (KL) scores of at least 2 in at least one knee x-ray. In 
addition, 199 male knee replacement cases from Nottingham were included. (For a cohort description, 
see the Nottingham description provided in Example 1). The control pool was made up of unrelated 
female individuals from the St. Thomas twin study (England) with normal knee x-rays and without other 
indications of OA, regardless of anatomical location, as well as lacking family history of OA. The St. 
Thomas twin study consists of Caucasian, female participants from the St. Thomas' Hospital, London, 
adult-twin registry, which is a voluntary registry of >4,000 twin pairs ranging from 1 8 to 76 years of age. 
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The replication sample 1 cohort was used to replicate the initial results. Table 6 below summarizes the 
selected phenotype data collected from the case and control individuals. 



TABLE 6 



Phenotype 


Female cases (n=248): 
median (range)/ (n,%) 


Male cases (n=199): 
median (range)/ (n,%) 


Female controls (n=313): 

mean (range)/ (n,%) 


Age 


59 (39- 73) 


66 (45- 73) 


55 (50- 72) 


Height (cm) 


162(141- 178) 


175(152- 198) 


162 (141- 176) 


Weight (kg) 


68 (51-123) 


86(62-127) 


64 (40-111) 


Body mass index 
(kg/m 2 ) 


26(18-44) 


29 (21-41) 


24(18-46) 


Kellgren- 
Lawrence* left 
knee 


0 (63, 26%), 1 (20, 8%), 2 
(105, 43%), 3(58, 23%), 4 
(1,0%) 


NA 


NA 


Kellgren- 
Lawrence* right 
knee 


0(43, 7%), 1 (18, 7%), 2 
(127, 52%), 3(57, 23%), 4 
(1,0%) 


NA 


NA 


KL* >2 both 
knees 


No (145, 59%), Yes (101, 
41%) 


NA 


NA 


KL* >2 either 
knee 


No (0, 0%), Yes (248, 100%) 


NA 


NA 



* 0: normal, 1 : doubtful, 2: definite osteophyte (bony protuberance), 3: joint space narrowing (with or without 
osteophyte), 4: joint deformity 



Sample Selection and Pooling Strategies - Replication Sample 2 

[0215] A third case control sample (replication sample #2) was created by using individuals with 
symptoms of OA from Newfoundland, Canada. These individuals were recruited and examined by 
rheumatologists. Affected joints were x-rayed and a final diagnosis of definite or probable OA was made 
according to American College of Rheumatology criteria by a single rheumatologist to avoid any inter- 
examiner diagnosis variability. Controls were recruited from volunteers without any symptoms from the 
musculoskeletal system based on a normal joint exam performed by a rheumatologist. Only cases with a 
diagnosis of definite OA were included in the study. Only individuals of Caucasian origin were included. 
The cases consisted of 228 individuals with definite knee OA, 106 individuals with definite hip OA, and 
74 individuals with hip OA. 



TABLE 7 



Phenotype 


Case 


Control 


Age at Visit 


62.7 


52.5 


Sex (Female/Male) 


227/119 


174/101 


Knee OA Xray: No 


35% (120) 


80% (16) 
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Unknown 


1%(4) 


0% (0) 


Yes 


64% (221) 


20% (4) 


Hip OA Xray: No 


63% (215) 


80% (16) 


Unknown 


2% (7) 


0% (0) 


Yes 


35% (121) 


20% (4) 



Assay for Verifying. Allelotvping. and Genotvping SNPs 

[0216] Genotyping of the replication cohorts described in Tables 6 and 7 was performed using the 
same methods used for the original genotyping, as described herein. A MassARRAY™ system 
(Sequenom, Inc.) was utilized to perform SNP genotyping in a high-throughput fashion. This genotyping 
platform was complemented by a homogeneous, single-tube assay method (hME™ or homogeneous 
MassEXTEND™ (Sequenom, Inc.)) in which two genotyping primers anneal to and amplify a genomic 
target surrounding a polymorphic site of interest. A third primer (the MassEXTEND™ primer), which is 
complementary to the amplified target up to but not including the polymorphism, was then enzymatically 
extended one or a few bases through the polymorphic site and then terminated. 

[0217] For each polymorphism, SpectroDESIGNER™ software (Sequenom, Inc.) was used to 
generate a set of PCR primers and a MassEXTEND™ primer which where used to genotype the 
polymorphism. Other primer design software could be used or one of ordinary skill in the art could 
manually design primers based on his or her knowledge of the relevant factors and considerations in 
designing such primers. Table 3 shows PCR primers and Table 4 shows extension probes used for 
analyzing (e.g., genotyping) polymorphisms in the replication cohorts. The initial PCR amplification 
reaction was performed in a 5 ul total volume containing IX PCR buffer with 1.5 mM MgCl 2 (Qiagen), 
200 uM each of dATP, dGTP, dCTP, dTTP (Gibco-BRL), 2.5 ng of genomic DNA, 0.1 units of HotStar 
DNA polymerase (Qiagen), and 200 nM each of forward and reverse PCR primers specific for the 
polymorphic region of interest. 

[0218] Samples were incubated at 95°C for 15 minutes, followed by 45 cycles of 95°C for 20 
seconds, 56°C for 30 seconds, and 72°C for 1 minute, finishing with a 3 minute final extension at 72°C. 
Following amplification, shrimp alkaline phosphatase (SAP) (0.3 units in a 2 ul volume) (Amersham 
Pharmacia) was added to each reaction (total reaction volume was 7 ul) to remove any residual dNTPs 
that were not consumed in the PCR step. Samples were incubated for 20 minutes at 37°C, followed by 5 
minutes at 85°C to denature the SAP. 

[0219] Once the SAP reaction was complete, a primer extension reaction was initiated by adding a 
polymorphism-specific MassEXTEND™ primer cocktail to each sample. Each MassEXTEND™ 
cocktail included a specific combination of dideoxynucleotides (ddNTPs) and deoxynucleotides (dNTPs) 
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used to distinguish polymorphic alleles from one another. Methods for verifying, allelotyping and 
genotyping SNPs are disclosed, for example, in U.S. Pat. No. 6,258,538, the content of which is hereby 
incorporated by reference. In Table 7, ddNTPs are shown and the fourth nucleotide not shown is the 
dNTP. 

[0220] The MassEXTEND™ reaction was performed in a total volume of 9 ul, with the addition of 
IX ThermoSequenase buffer, 0.576 units of ThermoSequenase (Amersham Pharmacia), 600 nM 
MassEXTEND™ primer, 2 mM of ddATP and/or ddCTP and/or ddGTP and/or ddTTP, and 2 mM of 
dATP or dCTP or dGTP or dTTP. The deoxy nucleotide (dNTP) used in the assay normally was 
complementary to the nucleotide at the polymorphic site in the amplicon. Samples were incubated at 
94°C for 2 minutes, followed by 55 cycles of 5 seconds at 94°C, 5 seconds at 52°C, and 5 seconds 
at 72°C. 

[0221] Following incubation, samples were desalted by adding 16 ul of water (total reaction volume 
was 25 pi), 3 mg of SpectroCLEAN™ sample cleaning beads (Sequenom, Inc.) and allowed to incubate 
for 3 minutes with rotation. Samples were then robotically dispensed using a piezoelectric dispensing 
device (SpectroJET™ (Sequenom, Inc.)) onto either 96-spot or 384-spot silicon chips containing a matrix 
that crystallized each sample (SpectroCHIP™ (Sequenom, Inc.)). Subsequently, MALDI-TOF mass 
spectrometry (Biflex and Autoflex MALDI-TOF mass spectrometers (Bruker Daltonics) can be used) and 
SpectroTYPER RT™ software (Sequenom, Inc.) were used to analyze and interpret the SNP genotype for 
each sample. 

Genetic Analysis 

[0222] Genotyping results for replication cohorts #1 and #2 are provided in Tables 8 and 9, 
respectively. 

TABLES 



rsID 


Replication #1 
(Mixed Male/Female cases and Female controls) 

AF OA Con AFOACas Delta P- value 


Meta-analysis 
Disc. + Rep #1 
P-value 


rs1 32659 


0.38 


0.34 


0.04 


0.128 


0.0077 



TABLE 9 



rsID 


Replication #2 (Newfoundland) 
(Male/Female cases and controls) 

AFOACon AFOACas Delta P-value 


Meta-analysis 
Disc, -i- Rep #2 
Not Done 


rs1 32659 


0.36 


0.36 


-0.001 


0.973 
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[0223] To combine the evidence for association from multiple sample collections, a meta-analysis 
procedure was employed. The allele frequencies were compared between cases and controls within the 
discovery sample, as well as within the replication cohort #1 using the DerSimian-Laird approach 
(DerSimonian, R. and N. Laird. 1986. Meta-analysis in clinical trials. Control Clin Trials 7: 177-188.) 

[0224] The absence of a statistically significant association in one or more of the replication cohorts 
should not be interpreted as rrurrimizing the value of the original finding. There are many reasons why a 
biologically derived association identified in a sample from one population would not replicate in a 
sample from another population. The most important reason is differences in population history. Due to 
bottlenecks and founder effects, there may be common disease predisposing alleles present in one 
population that are relatively rare in another, leading to a lack of association in the candidate region. 
Also, because common diseases such as arthritis-related disorders are the result of susceptibilities in 
many genes and many environmental risk factors, differences in population-specific genetic and 
environmental backgrounds could mask the effects of a biologically relevant allele. For these and other 
reasons, statistically strong results in the original, discovery sample that did not replicate in one or more 
of the replication samples may be further evaluated in additional replication cohorts and experimental 
systems. 

Example 4 
APOL3 Region Proximal SNPs 

[0225] It has been discovered that SNP rsl 32659 in APOL3 is associated with occurrence of 
osteoarthritis in subjects. APOL3 belongs to the high density lipoprotein family that plays a central role 
in cholesterol transport. The cholesterol content of membranes is important in cellular processes such as 
modulating gene transcription and signal transduction both in the adult brain and during 
neurodevelopment. It has been shown that the APOL1-APOL4 gene cluster on chromosome 22 exists 
only in primates (humans and African green monkeys) and not in dogs, pigs, or rodents, suggesting that 
this gene cluster has arisen recently in evolution (Monajemi et. al., Genomics 79: 539-546, 2002). Six 
transcript variants encoding three different isoforms have been identified. 

[0226] Expression of this gene is upregulated by tumor necrosis factor-alpha in endothelial cells 
lining the normal and atherosclerotic iliac artery and aorta (Horrevoets et. al., Blood 93: 3418-3431, 
1999). APOL3 is genetically linked to OA and may play a role in the pathophysiology of OA brought 
about by inflammation. APOL3 is likely inhibited by a small molecule inhibitor or by specific 
antibodies. APOL3 activity may be increased in a subject by administering APOL3 recombinant protein 
or a functional fragment thereof. 

[0227] Two hundred-nineteen additional allelic variants proximal to rsl 32659 were identified and 
subsequently allelotyped in osteoarthritis case and control sample sets as described in Examples 1 and 2. 
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The polymorphic variants are set forth in Table 10. The chromosome positions provided in column four 
of Table 10 are based on Genome "Build 34" of NCBI's GenBank. 

TABLE 10 
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dbSNP 
rs# 


Ch romosome 


Position in SEQ 
ID NO: 1 


Chromosome 
Position 


Allele 
Variants 


rs1 32622 


22 
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dbSNP 


Ch romosome 


Position in SEQ 


Chromosome 


Allele 


rs# 


ID NO: 1 


Position 


Variants 
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22 
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dbSNP 


Chromosome 


Position in SEQ 


Chromosome 


Allele 


rs# 




ID NO: 1 


Position 


Variants 
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22 


76027 


34857377 


a/g 


rs2227167 


22 


77949 


34859299 


a/g 


rs2227168 


22 


77974 


34859324 


c/t 


rs1 32700 


22 


78167 


34859517 


c/t 


rs3075364 


r 22 


78310 


34859660 


-let 


rs2227169 


22 


78415 


34859765 


c/t 


rs2097466 


22 


78575 


34859925 


c/t 


rs2097467 


22 


78590 


34859940 


c/t 


rs24 13387 


22 


78709 


34860059 


c/t 


rs1 32701 


22 


78875 


34860225 


c/t 


rs1 32702 


22 


79864 


34861214 


c/t 


rs1 32703 


22 


81316 


34862666 


c/t 


rs2269595 


22 


81320 


34862670 


a/g 


rs2269596 


22 


81409 


34862759 


c/t 


rs1 32704 


22 


81737 


34863087 


c/t 


rs2007468 


22 


81843 


34863193 


a/g 


rs1 32705 


22 


82102 


34863452 


c/t 


rs2007706 


22 


82833 


34864183 


c/t 


rs1 32706 


22 


83461 


3486481 1 


c/t 


rs1 32707 


22 


83624 


34864974 


c/t 


rs1 32708 


22 


83660 


34865010 


c/g 


rs1 32709 


22 


83701 


34865051 


a/t 


rs1 32710 


22 


83708 


34865058 


a/g 


rs1 32711 


22 


83782 


34865132 


c/t 


rs132712 


22 


85707 


34867057 


a/g 


rs132713 


22 


85717 


34867067 


a/q 


rs132714 


22 


86486 


34867836 


c/t 


rs132716 


22 


86833 


34868183 


a/g 


rs132717 


22 


87115 


34868465 


c/t 


rs132718 


22 


87234 


34868584 


a/g 


rs132719 


22 


87479 


34868829 


q/t 


rs1 32720 


22 


87561 


3486891 1 


a/a 


rs1 32721 


22 


87604 j 


34868954 


a/g 


rs1 32722 


22 


87674 


34869024 


c/t 
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dbSNP 
rs# 


Chromosome 


Position in SEQ 
ID NO: 1 


Chromosome 
Position 


Allele 
Variants 


rs1 32723 


22 


87958 


34869308 


a/q 


rs1 32724 


22 


87992 


34869342 


-/q 


rs1 32725 


22 


88019 


34869369 


a/q 


rs1 32726 


22 


88074 


34869424 


c/q 


rs1 32727 


22 


88079 


34869429 


c/q 


rs1 32728 


22 


88115 


34869465 


a/q 


rs1 32729 


22 


88118 


34869468 


c/q 


rs1 32730 


22 


88120 


34869470 


a/q 


rs1 32731 


22 


88135 


34869485 


-/ctcat 


rs1 32732 


22 


88142 


34869492 


q/t 


rs 132733 


22 


88143 


34869493 


q/t 


rs1 40575 


22 


88149 


34869499 


aca/tq 


rs1 32734 


22 


88340 


34869690 


a/q 


rs1 32735 


22 


88344 


34869694 


q/t 


rs80587 


22 


88512 


34869862 


c/q 


rs1 32736 


22 


88521 


34869871 


c/t 


rs1 32737 


22 


88650 


34870000 


c/q 


rs1 32738 


22 


88827 


34870177 


c/t 


rs1 807673 


22 


89230 


34870580 


a/g 


rs20 14700 


22 


89236 


34870586 


a/q 


rs1 32739 


22 


90754 


34872104 


q/a 


rs1812023 


22 


90984 


34872334 


a/q 


rs1812024 


22 


91110 


34872460 


a/q 


rs2005590 


22 


92026 


34873376 


c/t 


rs1 32740 


22 


92954 


34874304 


c/t 


rs3986001 


22 


93375 


34874725 


-/ttgc 


rs24 13390 


22 


93794 


34875144 


c/t 


rs 132743 


22 


94937 


34876287 


c/q 


rs1 32744 


22 


95068 


34876418 


c/t 


rs2413391 


22 


96188 


34877538 


a/q 


rs1 32749 


22 


97092 


34878442 


c/t 


rs1 32750 


22 


98812 


34880162 


c/t 


rs1 32741 


22 


not mapped 


not mapped 


a/c 


rs2413388 


22 


not mapped 


not mapped 


a/t 


rs24 13389 


22 


not mapped 


not mapped 


c/q 



Assay for Verifying and Allelotyping SNPs 

[0228] The methods used to verify and allelotype the 219 proximal SNPs of Table 10 are the same 
methods described in Examples 1 and 2 herein. The primers and probes used in these assays are provided 
in Table 1 1 and Table 12, respectively. 



TABLE 11 



dbSNP 


Forward 


Reverse 


rs# 


PCR primer 


PCR primer 
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dbSNP 

rs# 


Forward 
PCR primer 


Reverse 
PCR primer 


rs3888818 


ACGTTGGATGTGAGGTCAGGAGTTTGAGAC 


ACGTTGGATGCCATGCCCAGCTAATTTTCG 


rs20 10605 


ACGTTGGA 1 G 1 G 1 GC I 1 1 1 IAIGICI IAGG 


ACGTTGGATGGACTTTTAGAAGAAAAGTAC 


rs743919 


ACGTTGGATGTTCTTCACCAAGCCCTCTTC 


ACGTTGGATGCCCAACACACACAAAGATGG 


rs1008134 


ACGTTGGATGACATATCCGGGCATCTTTTC 


AC GTTGG ATG C ATCC AC AG G ATG C AAT ATC 


rs1 32607 


ACGTTGGATGTAGTTTGCAGGTCACAAGGG 


ACGTTGGATGGGAAGGAAGACCCAAACAGC 


rs1 476029 


ACGTTGGATGTGGGTGACAGAGATCCTTAC 


ACGTTGGATGAAACTCAGGGAAACGGACTC 


rs1 476030 


ACGTTGGATGATCCTAGCACTGGGATTTGG 


ACGTTGGATGCGTTTCCCTGAGTTTCACTG 


rs2413380 


ACGTTGGATGATGATACTGAGTCCAGGAGG 


AC GTTG G ATG AAAG GCTACTTCTTG CTC AC 


rs2051609 


ACGTTGGATGTAATCCCAGCACTTTGGGAG 


ACGTTGGATGACAGACGGGGTTTCATCATG 


rs2413381 


ACGTTGGATGAGGGCTGCAGTAGAAAAGCG 


ACGTTGGATGACGATGCGTGTGCCGACAG 


rs1 894604 


ACGTTGGATGAAGTGCTGCTGCAAAAAGAG 


ACGTTGGATGTTCTCCACTTCCATTCTGTG 


rs1 894605 


ACGTTGGATGTGATGAGAGATGCAGATCGC 


ACGTTGGATGTATTCAGAATTCAGCCTGCG 


rs1 32609 


ACGTTGGATGGCATTTGAAAGGTCCGTATC 


ACGTTGGATGCCCAAATCTGTCTTTTAGCC 


rs1 32610 


ACGTTGGATGTTCTACAAGAGCTAGGGACC 


ACGTTGGATGGATCTATTGCTGCTTAGGCC 


rs1 32611 


ACGTTGGATGGCCTTCTTTACTCTGTCCTC 


ACG I 1 GGA1 GGG 1 1 1 1 C I 1 1 CAGG 1 CC 1 CC 


rs132612 


ACGTTGGATGAAAAATCTTCCCGCTACCTC 


ACGTTGGATGTTCTGTGAGCTTTCTCTCTG 


rs1 008790 


ACGI IGGAIGGIGAGGICCCI 1 I IA1GAIG 


AC GTTG G ATG ATAAC AG CCC CTG AC AG ATG 


rs23085 


ACGTTGGATGAAACCGTGCCAGCTGAGGAT 


ACGTTGGATGGTCGGCAAGGAAGAGGAATC 


rs105161 


ACGTTGGATGAAGGAGCGGGAAATCTTTTG 


ACGTTGGATGGTAGGAGGCTGAAATGCTAG 


rs132613 


ACGTTGGATGATGTAAAACCAATGGCCTCC 


ACGTTGG ATG AAG CTTC AG ATTGTTCACCC 


rs132614 


ACGTTGGATGCAAGAGCCCTGCTTTGTGAG 


ACGTTGGATGTCCTGCACCAGCAGAGATGA 


rs132615 


ACGTTGGATGAAATCTGGAGGCTTGGTGAC 


ACGTTGGATGTGAGCATTCACATGGGACAG 


rs132617 


ACGTTGGATGGAGAAGAAGAGTGTGTGTGC 


ACGTTGGATGACAGCCACCTGAATTTGTGC 


rs3865724 


ACGTTGGATGTTCCTGGATAATTCCCATTC 


ACGTTGGATGGCTGGATCACTGAAGAAGTA 


rs2019657 


ACGTTGGATGACGCCAGAACATTGTGTTTC 


ACGTTGGATGGTGCCAGAAACATTCAAAGC 


rs3865725 


ACGTTGGATGAATATAGAACTGCTGGGCGC 


ACGTTGGATGTGACTTAGGAGAGGTTCTGG 


rs2019364 


AC GTTG G ATGTG ACTTAG G AG AG GTTC TG G 


ACGTTGGATGAATATAGAACTGCTGGGCGC 


rs2008383 


ACGTTGGATGTGATTCTAGGAGCAGGACTG 


ACGTTGGATGACATGGGTGACCCTATCAAG 


rs3986002 


ACGTTGGATGCTTCTGTCCTTCTCTGTGTC 


ACGTTGGATGCAGGCAGAGGATTTGTTTGG 


rs3888942 


ACGI IGGAIGCIGGGCI 1 1 IGIGCIAAGAG 


ACGTTGGATGGGGCCAATTTGCCCCATAAA 


rs3888943 


ACGI IGGAIGCIGGGCI 1 1 IGIGCIAAGAG 


ACGTTGGATGTTGGGCCAATTTGCCCCATA 


rs3888944 


ACGTTGGATGATACAGCCCTTGCCACTATG 


ACGTTGGATGTTGAAGACATGGAAGCAGGG 


rs132618 


ACGTTGGATGAATCCGTGCCATCAGGCAAG 


ACGTTGGATGCCTGCAATCGTTCTCTCTGC 


rs132619 


ACGTTGGATGTCATCAGCAGAAGCTGAAGC 


ACGTTGGATGG GTGTG ATGTC AC G C ATAAC 


rs3827346 


ACGTTGGATGAAGAGGTCCACAGAGGCTG 


ACGTTGGATGAAAACAAGACCAGCAAGGGA 


rs1 32620 


ACGTTGGATGTCACATTAGATCAGGAAGCC 


ACGTTGGATGTTAGGCCAGTTTAGCAGAAA 


rs1 32621 


ACGTTGGATGCTTCAAATCTGCAACTGGTG 


ACGTTGGATGGATAGCTTAAGGACTCAGAG 


rs80575 


ACGTTG G ATG G CTG C AC ATG AACTCTC AAG 


ACGTTGGATGTGACATGTGACAGTGAGACC 


rs80576 


ACGTTGGATGTGAAGCTGTCACCTGCTAAG 


ACGTTGGATGAACTCTCAAGCCACTTGACG 


rs80577 


ACGTTGGATGAGTCCATAAGAGGTTCCATG 


ACGTTGGATGAACTAATGCCTTAGCAGGTG 


rs80578 


ACGTTGGATGACTGTTTCCCTGACAGCATG 


ACGTTGGATGTGTAGAACAGAAGAGGGTCC 


rs80579 


ACGTTGGATGTGGGAGTAGGGTGAGAAGAG 


AC GTTGG ATG ACTCACTGGTCCTCTGC AAG 


rs80580 


ACGTTGGATGTTCAATCAGATGGGCGTGTG 


ACGTTGGATGGATGGCATCATGCTACTTGG 


rs1 32622 


ACGTTGGATGTATGTCTTGGAGACTGGGAC 


ACGTTGGATGACCTGCTGTTCATTCTCAGG 


rs1 32623 


ACGTTGGATGAGCTCTGTCCAACTCCATTC 


ACGTTGGATGCTGAGGAACTGCACAAACAC 


rs1 32624 


ACGTTGGATGTGCTGGGATTACAGGCATGA 


ACGTTGGATGTCAAAGAAAGTCCTGCTGGG 


rs1 32625 


ACGTTGGATGTTTCACGCCATTCTCCTGCC 


ACGTTGGATGCGATGAAACCCCGTCTGTAC 
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dbSNP 
rs# 


Forward 
PCR primer 


Reverse 
PCR primer 


rs 132626 


ACGTTGGATGTGGAGTGCAGTGGCATGATC 


ACGTTGGATGGCAGGAGAATGGCGTGAAAC 


rs1 32627 


ACGTTG G ATG AG G AG AATG G CGTG AAACC G 


ACGTTGGATGAGACAGAGTCTTGCTTGTCC 


rs1 807672 


ACGTTGGATGGTGTGCTACAGCCTAAATGG 


ACGTTGGATGAATACCCCATGTGACAGCTG 


rs1 32628 


ACGTTGGATGTATAGACTGAGTTGTGTGCC 


AC GTTG G ATGTC CTTAAAG G CTC AATCTCC 


rs1 32629 


ACGTTGGATGCTCTCTCCCTGTCTCTCTTT 


ACGTTGGATGTGTGTCCTCACATGGCCTTC 


rs1 32630 


ACGTTGGATGTTCCAAGGTGAAGGTGCCAG 


ACGTTGGATGAAGGCCATGTGAGGACACAG 


rs1 32631 


ACGTTGGATGGGTGGCTCCAACAACTGATG 


ACGTTGGATGATCAACCCTGCTGGCACCTT 


rs1 32632 


ACGI IGGAIGCI IGGAAI I I I IGCCICCAG 


ACGTTG G ATGTC AG G ATG C CTTAGT AAAAC 


rs1 32633 


ACGTTGGATGAGAAGAGTGATTCACCAGGG 


ACGTTGGATGGGAAAGCTCACTTTCTGGTG 


rs 132634 


ACGTTGGATGAAGTGCCATGGTGCTTTGTG 


ACGTTGGATGGAAAGCATGGTGGAAAGCTC 


rs 132635 


ACGTTG G ATG AATAG G C AC ATG GC AG AAG G 


ACGTTGGATGCACCAGAAAGTGAGCTTTCC 


rs 132636 


ACGTTGGATGAAGCGTTTGACAATAGGCAC 


ACGTTGGATGAAAGTGAGCTTTCCACCATG 


rs1 29603 


ACGTTGGATGGTGTCATATTGACACAGATTG 


ACGTTGGATGAGGGTGTATATATATATACCC 


rs1 32637 


ACGTTGGATGGCATCTTAGTACACAGCAGG 


ACGTTGGATGTTCCCAAATCCCTGCAAACC 


rs3788518 


ACGTTGGATGAATCCTTCAGAAGGGCTTGG 


ACGTTGGATGGCCGCGTTATTAAACCACAG 


rs1 32638 


ACGTTGGATGCATCCTTTCAGTGAAGGAGG 


ACGTTGGATGTTGCCAAGGCAACTCAGTGA 


rs1 32639 


ACGTTGGATGACACCTGGGCAAACAAAAGC 


ACGTTGGATGAAGTTCCCCATAGTTGGCAG 


rs1 32640 


ACGTTGGATGTAAGAAGCTCCAGGTGACAC 


ACGTTGGATGAAAAGAGTGACTCAGCGTCC 


rs1 32641 


ACGTTGGATGAGGGTCAGCTGGGAGCAGA 


ACGTTGGATGAGGGCTGAGAGAGGAGGTTG 


rs1 32642 


ACGTTGGATGAAGAAGCAAGCCTACCTGAG 


ACGTTGGATGAAACGAACCCTTCCAGTCAG 


rs1 32643 


ACGTTGGATGATCACAGACACCCAGTACAC 


ACGTTGGATGACGTTCTGACAATGACCTGG 


rs1 32644 


ACGTTGGATGGCATAGAGTGCAAGACACAG 


ACGTTGGATGGGGCTCCACTCCCTTAAATA 


rs1 32645 


ACGTTGGATGTGAAGGCAAACAGTACAAGA 


ACGTTGGATGAAGTTAACCAAGTGTTTAC 


rs2017329 


ACGTTGGATGCCTTCCCAATTAAAAGCAGC 


ACGTTGGATGGGGCAACAAGAGTGAAATTC 


rs739198 


ACGTTGGATGAAACTTTGGTCTCCACAACC 


ACGTTGGATGTGAGTTTGTCTAAAGACCGG 


rs1 32647 


ACGTTGGATGCCTCACTACAGAAACCATGG 


ACGTTGGATGAACTCAACTGGTTCAACCAC 


rs2097465 


ACGTTGGATGGAATTGACCAAACTGCAGGC 


ACGTTGGATGAGGGTTGAAGCTGGATACTG 


rs2105915 


ACGTTGGATGAACCCAGGAGTTCAGGACAA 


ACGTTGGATGGGGAACTACAAGTGCATCAC 


rs1 32648 


ACGTTGGATGGTGGCTCAGGGCTGTAATTC 


ACGTTGGATGTGTCCTGAACTCCTGGGTTC 


rs1 32649 


ACGTTGGATGGTCCTCCCCAGTCTTATTAC 


ACGTTGGATGATTCAGAGGTTAGCTGGCTC 


rs1 32650 


ACGTTGGATGAAAGTGCTGGGATTACAGGC 


ACGTTGGATGCTAAATCTCCTGCCATAGGG 


rs1 32651 


ACGTTGGATGAGGTCAGGTGTTGACCTTCC 


ACGTTGGATGAGCAGGGTAGGGCATCCTAA 


rs1 32652 


ACGTTGGATGAGCAGGGTAGGGCATCCTAAC 


ACGTTGGATGAGGTCAGGTGTTGACCTTCC 


rs80584 


ACGTTGGATGCATGGAGTCCTGTGATCTAC 


ACGTTGGATGAAACTGAGGCCATGGGAGAT 


rs 132653 


ACGTTGGATGCAGCCGTGCATCTGCATAAT 


ACG I I GGA I GCAC I I I CCC I I I I GGG I I CC 


rs916334 


ACGTTGGATGAAACAGGATGCTTCCCAGCC 


ACGTTGGATGCTGCTCTTGGATCAGCAGGA 


rs1 32654 


ACGTTGG ATGTAAGG AAGTGTCC AGAAG C C 


ACGTTGGATGAAATGATCCTCCTGCCTCAG 


rs1 32655 


ACGTTGGATGACTTTTCCAGGTGAAGGTAG 


ACGTTGGATGTTGTCGAACGCCATACCTGT 


rs1 32656 


ACGTTGGATGTCACTAGAAGCAAGGAACCC 


ACGTTGGATGCCCAGGTTGACTGAACAAAG 


rs3834684 


ACGTTGGATGAGCCCTTGTTCACTAGAAGC 


ACGTTGGATGGGTGGATGTGGGAGTAAAAG 


rs 132657 


ACGTTGGATGGGCTGACTGACAATTACCTG 


ACG I I GGA I GAGGGC I C I GAGC I I I I CAAG 


rs916335 


ACGTTGGATGGATGACGAGAAAAAGGTGGG 


ACGTTGGATGATTTGAAGGATGCAGTCTTG 


rs1 32659 


ACGTTGGATGGGCCCATAGTGGGTCATAAC 


ACGTTGGATGGTGGGGTGAGTGCCCAAAAG 


rs1 32660 


ACGTTGGATGTACATGTGGTTGTACCCTCC 


ACGTTGGATGCTGGCATGGTTTTACCCATC 


rs1 32661 


ACGTTGGATGCTTCGCAGAAATCATTCCGC 


ACGTTGGATGCCAAAATGCAAGCTCAAGGC 


rs1 32662 


ACGTTGGATGCATCTCTTAAATGGGCCAGG 


ACGTTGGATGTTGAAAGCCACAGCCTCATG 


rs1 32663 


ACGTTGGATGCCACTAAACGGATTGAGATC 


ACGTTGGATGTGGCTTTCAACCAGCAACTC 
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dbSNP 
rs# 


Forward 
PCR primer 


Reverse 
PCR primer 


rs1 32664 


ACGTTGGATGATCATGCCACTGCAATCCAG 


ACGTTGGATGGCATATGTGACTGCTTCCTC 


rs 132667 


ACGTTGGATGCTGGAGAAATCAAATAGAGAG 


ACGTTGGATGTGTACAGCTTTTGACAGTTG 


rs1 32670 


ACGTTGGATGTAAGGTCGGGAGTTCAAGAC 


ACGTTGGATGACGCCCGGCTGATTTTGTAT 


rs1 32671 


ACGTTGGATGGTGAGCCATACCATCACATC 


ACGTTG G ATG CTGTAGTAAAG GTCTG GTC G 


rs 132672 


ACGTTGGATGCTCCCCAATAAGCTCAACAC 


ACGTTGGATGCTGTTAGGGCAATGAAAGGC 


rs1 32673 


ACGTTGGATGTGAGTAGTTGGTGTGAGTGG 


ACGTTGGATGCAATGGATGAAGCTGATCCC 


rs1 32674 


ACGTTGGATGAACTGTAGTCCCAGCTACTC 


ACGTTGGATGTAGCTCTATCACTCATGCTG 


rs 132675 


ACGTTGGATGGTAGAGCAGATGTGCAATGG 


ACGTTGGATGTCCTAACCATCTGCCTTGTG 


rs80585 


ACGTTGGATGCTGTTGTTCCAACACTTCAC 


AC GTTG GATGG GTCTG CTACTAGAATTC AG 


rs80586 


ACGTTGGATGGTAAGTGTAAGAAGGTCTGC 


ACGTTGGATGCAAGGCATAATATTCTGACC 


rs 132676 


ACGTTGGATGCAAACATTCTGCAGAAAGCG 


AC GTTG G ATG AAG CGTGTTG CTG AG AAATG 


rs1 32677 


ACGTTGGATGCTCTGTTACAAAATGAAGGG 


ACGTTGGATGGCTATCTAGGCTAAAAATCCC 


rs1 32678 


AC GTTGG ATG AAG G C ACTG AAAATGCCTAG 


ACGTTGGATGGGAATCCAGATGCTTACATG 


rs 132680 


ACGTTGGATGGCCTTAGCTATCATGTTCTC 


ACGTTGGATGGCGTGTTTAAGGCAATTCTC 


rs1 32681 


ACGTTGGATGTACTGAAGCCTGAGACTAGC 


ACGTTGGATGCTAGCAGAAACTAACCGAGC 


rs1 32683 


ACGTTGGATGTTACCCTATGGTAATGGCAG 


ACGTTGGATGACTGATTAGTACAGGAAGGG 


rs2269594 


ACGTTGGATGTGGCATGGCTAAAAGGACAG 


ACGTTGGATGGATTGTTCTGATGCCCAGTG 


rs 132684 


ACGI IGGAIGCCI I I IAGCCAIGCCAI ICG 


ACGTTGGATGTCAGTGTAAAACGTGCCACC 


rs 132685 


ACGTTGGATGTCAGTGTAAAACGTGCCACC 


ACGTTGGATGCCTTTTAGCCATGCCATTCG 


rs1 32686 


ACGTTGGATGCAGAATATCCACGTCAGGTG 


ACGTTGGATGGACAGCTTAGGACTATGTGC 


rs1 32687 


ACGTTGGATGCAACTGTAAGCAGCCCATTG 


ACGTTGGATGCTGACGGTGCAAATGGATAC 


rs1 32688 


ACGTTGGATGAGTACTACAGGACGTGCTTG 


ACGTTGGATGGGTCGCCTCATATATGGTAG 


rs1 32689 


ACGTTGGATGTACTGGGACAGTCTGCTTTC 


ACGTTGGATGACTTTACAGTGCTGGAGCAG 


rs1 32690 


ACGI IGGAIGIGI I I IGCI I IGCGCICICC 


ACGTTGGATGTCTGCAACCAACTCTTTGGG 


rs1 32691 


ACGTTGGATGGTCAAAGCCAGGCATTTGTC 


ACGTTGGATGCTGTCATCTTGTGGAAAGGG 


rs1 32692 


ACGTTGGATGGAATCTAAGCCAGCTGTTGG 


ACGTTGGATGGGAGCATCATGTGGATTCCT 


rs1 32693 


ACGTTGGATGGCCAGAAGAAAAGAGTGTGG 


ACGTTGGATGATTCTGCATGTGGAACGTCC 


rs1 32694 


ACGTTGGATGATAGAGACTGAGAGCTGCAG 


ACGTTGGATGCAGAACAAAGCAGGAAGCTC 


rs1 32695 


ACGTTGGATGGCCTCTCTCTATGACTACAC 


AC GTTG G ATGTTC AC AG C AG GG AACTCTTC 


rs1 966266 


ACGTTGGATGTGATTGTACAAGGCAGACCC 


ACGTTGGATGTGTAAGCACCTGCATTCAGC 


rs 1966267 


AC GTTG G ATGTAACTC AC AG ACC ATG AG GG 


ACGTTGGATGGGAGGAAAGCACAGCAGAAT 


rs 106808 


ACGTTGGATGAACAAGGCAGATCCTTCCCG 


ACGTTGGATGATGGTTCCTGAAGAGCAGTG 


rs1 32696 


ACGTTGGATGATGGTTCCTGAAGAGCAGTG 


ACGTTGGATGAACAAGGCAGATCCTTCCCG 


rs2239829 


ACGTTGGATGTGTCTTTGTCGTTCGGATGG 


ACGTTGGATGAAAGAGCGAAACTCCGTCTC 


rs2285154 


ACGTTGGATGTGAACTCAAATGATCCGCCC 


ACGTTGGATGAAGAACCCTTTTCGACTGGG 


rs2239830 


ACGTTGGATGAAACCCTAATGGGAAGCCTC 


ACGTTGGATGTGTGGTAGCAAGCAGTTGAC 


rs2239831 


ACGTTGGATGAGAACAGTCACTGACCCAAG 


ACGTTGGATGGCTCCACACACTTTGATTCC 


rs3865722 


ACGTTGGATGCCACTGTACTGCTAGTATTG 


ACGI IGGAIGACCIGCICIAGI I I ICAICC 


rs3865723 


ACGTTGGATGCCTGCATTTGATGCAATTCC 


AC GTTG G ATG GTTTCTGTTTCTGTTGCTTGC 


rs3985996 


ACGTTGGATGACATGGGTGACCCTATCAAG 


ACGTTGGATGTGATTCTAGGAGCAGGACTG 


rs3985997 


ACGTTGGATGCAAGAATTTCTCCCGGCATC 


ACGTTGGATGTGATTCTAGGAGCAGGACTG 


rs3985998 


ACGTTGGATGTGATTCTAGGAGCAGGACTG 


ACGTTGGATGCAAGAATTTCTCCCGGCATC 


rs3985999 


ACGTTGGATGTGATTCTAGGAGCAGGACTG 


ACGTTGGATGCAAGAATTTCTCCCGGCATC 


rs3986000 


ACGTTGGATGCAGG C AG AG G ATTTGTTTG G 


ACGTTGGATGCTTCTGTCCTTCTCTGTGTC 


rs2413382 


ACGTTGGATGAAACAAATCCTCTGCCTGGG 


ACGTTGGATGAAAAGCCCAGAGCCTTCATG 


rs2413383 


ACGI I GGA I GC I GGGC I I I I G I GC I AAG AG 


ACGTTGGATGGGGCCAAGTTGACCCATAAA 


rs2413384 


ACGI I GGA IGCI GGGC I I I IGIGCIAAGAG 


ACGTTGGATGGGGCCAAGTTGACCCATAAA 
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rs# 


Forward 
PCR primer 


Reverse 
PCR primer 




A O OTTOO A TO A ATO OTOTTOO fT^ATATAP 
MUu 1 1 OOA 1 UnM IUOIUI 1 bob 1 On 1 AV/A^ 


Ar^TT^nAT^ A TOO A A^rmi^T^ 1 1 1 OATO 
Abb 1 1 OOM 1 OA 1 bbAAbbbbb 1 O 1 1 1 OA 1 O 


iSZH I OOOO 


apottooato mm atoootpaaptto.opp 


APOTTOO ATOTTPP A A A ATP.OTPTTPOPTO 
Abb 1 1 OOA 1 O 1 1 bbAAAA 1 OO Ibl 1 bob 1 O 


rsi oytouo 


APOTTOOATO 1 1 1 1 ATOOOTPA APTTOOPP 
ML»U 1 1 uUn 1 O 1 1 1 Inl OOO 1 UHMLi 1 1 OOOO 


APOTTOO ATO AOAPTPPTOPA A A AOPTTPP 
AbO 1 1 OOA 1 OAOAb 1 bb 1 ObAAAAOb 1 1 bb 




APfiTT^f^AT^^r^AT^Ar^^f^TA 1 1 1 OPTOTP 
AbO 1 1 OOM I OOOA 1 OAOOO 1 M 1 1 1 O 1 U 


A OOTTOO ATOOOOOTOT ATOT A OOTTO A A O 
AbO 1 1 OOA 1 OOObb 1 O 1 A 1 O 1 AOO 1 1 OAAO 


rsy i ooo r 


A rPTTr.^AT^ AO ATP A O A A^f^^PrTnTATn 
AbO 1 1 oon 1 onuft 1 UnonnuouUu tolnlu 


AOOTTOO ATO A 1 1 1 OPTOTPTOPPTPTPTP 
AbO 1 1 OOA 1 OA 1 1 1 Ob 1 O 1 b 1 OOb lolblb 


rsyioootj 


APPTTrtftATPATTT^PTfiTPTPr^T/lT^TP 
AUo 1 1 OOA 1 OA 1 1 1 oU lolUI obL 1 O 1 b 1 b 


AOOTTOO ATO AO ATO AO A AOOOOOTOTATO 
AbO 1 1 OOA 1 OAOA 1 bAOAAOOObb lolMlb 


«•< QOCQ7 


AOPTm^AT^rAT^APP flAPAPA AO TO A OP 

Abo 1 1 OOA 1 OOA 1 OAOOAAOAOAAO 1 OAbo 


APOTTOO ATO AP APTO APTOATO APTOAOP 
AbO 1 1 OOA 1 OAbAb 1 OAb 1 OA 1 OAb 1 OAOb 


rs12781 


ACGTTGGATGTTACACACAGGGCACTCAGC 


ACGTTGGATGGAGCCAGAAAATTAACTGAAAA 
GC 


rs1 053983 


ACGTTGGATGCCATCATCAAGAAGCCACTG 


ACGTTGGATGTGTCTTACCAGCATCCACTC 


rs1 053982 


ACGTTGGATGGGAGAGTGGATGCTGGTAAG 


ACGTTGGATGCGGTTGAATGTCCTTCCAAG 


rs2227167 


ACGTTGGATGCTCCTGAGTGTATGGACATC 


ACGTTGGATGGGCATTAAGGGACATTCTGC 


rs2227168 


ACGTTGGATGCCAATTGGAGGCATTAAGGG 


ACGTTGGATGCTCCTGAGTGTATGGACATC 


rs1 32700 


ACGTTGGATGAATGTGGTGTCTGGCTCCAC 


ACGTTGGATGCTCAGCCCTGCTGTAAATGG 


rs3075364 


ACGTTGGATGGAACAGCAGTTTAGGGAGTG 


ACGTTGGATGCGAAGCCTTTCTATGGACTC 


rs2227169 


ACGTTGGATGAGGTAAGTAAGCTGCCTTTC 


ACGTTGGATGTTCAGAGCTTCATAGAGAGC 


rs2097466 


ACGTTGGATGCTGGGATTACAAGCATGAGC 


AC GTTGG ATG CTG C ATAAATC AC AG AG CTG 


rs2097467 


ACGTTGGATGATCTCCTGACCTTGTGATCC 


ACGI IGGAIGAI IC1 1 1 1 CAAGGCGGGGCG 


rs24 13387 


ACGTTGGATGAAGTAGCTGGGACTACAGGC 


ACGTTGGATGTAACACGGTGAAACCCCGTC 


rs 132701 


ACGTTGGATGGTGGCATATCTATGTTGTAC 


AC GTTG G ATG GCGAGACTC C ATCTC AAAAA 


rs1 32702 


ACGTTGGATGGAAGCTCACCCAGTTAAGGA 


ACGTTGGATGCCCCTGTAACAACAATCCTG 


rs 132703 


ACGTTGGATGCTTGACCTGATCAATGTGTG 


ACGTTGGATGTTTGTGCAGTTCCTCAGAAG 


rs2269595 


ACGTTGG ATGAG AAGTTCAGG AAAAGG GCC 


ACGTTGGATGCAGCAGGACTTTCTTTGGGA 


rs2269596 


ACGTTGGATGAGGTGCTCAGTTAGCGTTAC 


ACGTTGGATGTCCCAAAGAAAGTCCTGCTG 


rs1 32704 


ACGTTGGATGATATTCTTCCTGCACTGCTG 


ACGI IGGAIGAICICCCCGGGCIAGI 1 1 IC 


rs2007468 


ACGTTGGATGAGGTTACCTGGGCAATTCAG 


ACGTTGGATGGAAAATCCTGCTGACTAGCG 


rs1 32705 


ACG 1 1 GGA 1 G 1 1 1 1 GA 1 GG AGGCACCAG 1 G 


ACGTTGGATGTCTCCAAATACGGTCACTGG 


rs2007706 


ACGTTGGATGCCCAGGAATTTACATAAGGG 


ACGTTGGATGTTGAACATAGCAAGAGTGAG 


rs1 32706 


ACGTTGGATGAAGGATCAGTGCTGAGGGTC 


ACGTTGGATGATTCCTCCTGCTGGTCATGG 


rs1 32707 


ACGTTGGATGAATCCTTAGGAAGGGCTGGG 


ACGTTGGATGAGCTGGCCCCGTTAGTAAAC 


rs1 32708 


ACGTTGGATGTCTTGTTTCAGAGGGAGAGC 


ACGTTGGATGTCTCAGCCAATCCCAGAATC 


rs1 32709 


ACGTTGGATGTGAGTCCTGTCCAAGATGAG 


ACGTTGGATGAGCCCTTCCTAAGGATTCTG 


rs1 32710 


ACGTTGGATGTGAGTCCTGTCCAAGATGAG 


ACGTTGGATGCCTAAGGATTCTGGGATTGG 


rs1 32711 


ACGTTGGATGTCATCTTGGACAGGACTCAG 


ACGTTGGATGTTGCCATGGCAACCAAGTCA 


rs132712 


ACGTTG G ATG GTCTTC AAG G CTG AGTG AGC 


ACGTTGGATGACTCCACGTGGCCTCTCTTG 


rs132713 


ACGTTGGATGAAGGCTGAGTGAGCCCCAAC 


ACGTTGGATGACTCCACGTGGCCTCTCTT 


rs132714 


ACGTTGGATGACACGGTGAAACCCCTTCTC 


ACGTTGGATGAGTAGCTGGGACTACAGGTG 


rs132716 


ACGTTGGATGTGGATTTGCAATGAGGAGTC 


ACGTTGGATGTCAATGACTGTGCTCTACTC 


rs132717 


ACGI 1 GGA 1 GAA 1 G 1 GGGCAG 1 1 1 IACGIG 


ACGTTGGATGGATGGACCTTAGGGTGTTTC 


rs132718 


ACGTTGGATGTCAGAGGGTATCAACATCTC 


ACGTTGGATGTGGGCATCTTCATATACTGC 


rs132719 


ACGTTGGATGATACCCTCAGTTGTACCCAG 


ACGTTGGATGCTGAACAAAGGAGAAGGAGG 


rs1 32720 


ACGTTGGATGATACTGGGTACAACTGAGGG 


ACGI IGGAIGCCICICCACCI 1 1 ICCIAAC 


rs1 32721 


ACGTTGGATGCAGCTACAAAGTTGCTAATGG 


ACGTTGGATGTCTTATTTGTACCCTCCCTC 


rs 132722 


ACGTTGGATGACAATGGTAATGCTTGGAGC 


ACGTTGGATGGGGAGGGTACAAATAAGATG 


rs1 32723 


ACGTTGGATGACACAGATGTCTGTCTTCTG 


ACGTTGGATGATACTCCCCTGGTGAATGCT 


rs1 32724 


ACGTTGGATGCCCCATGCAACAAGGGTAAA 


ACGTTGGATGTGTCCCTTACAGCAAGAAGC 


rs1 32725 


ACGTTGGATGCCCCATGCAACAAGGGTAAA 


ACGTTGGATGTGTCCCTTACAGCAAGAAGC 
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dbSNP 
rs# 


Forward 
PCR primer 


Reverse 
PCR primer 


rs1 32726 


ACGTTGGATGGGGTCACACAGTGAACAAAG 


ACGTTGGATGTTCTTGCTGTAAGGGACAGG 


rs1 32727 


ACGTTGGATGGGGTCACACAGTGAACAAAG 


ACGTTGGATGTTCTTGCTGTAAGGGACAGG 


rs1 32728 


ACGTTGG ATG AGTTCTACTG GCTCATGGTG 


ACGTTGGATGTTCGCCTTCTTCCTGCTTTG 


rs1 32729 


ACGTTGGATGAGTTCTACTGGCTCATGGTG 


ACGTTGGATGTTCGCCTTCTTCCTGCTTTG 


rs1 32730 


ACGTTGGATGTTCGCCTTCTTCCTGCTTTG 


ACGTTGGATGAGTTCTACTGGCTCATGGTG 


rs1 32731 


ACGTTGGATGTTTGTTCACTGTGTGACCCC 


ACGTTGGATGTCCTGGTCCTCCCAGTTCTA 


rs1 32732 


ACGTTGGATGGCCAAGGCAACCATCTCAAC 


ACGTTGGATGTGCAGCTCATCACAAGCGTC 


rs1 32733 


ACGTTGGATGGCAACCATCTCAACACCATG 


ACGTTGGATGTGCAGCTCATCACAAGCGTC 


rs1 40575 


ACGTTGGATGTGCAGCTCATCACAAGCGTC 


ACGTTGGATGCCATCTCAACACCATGAGCC 


rs1 32734 


ACGTTGGATGGGATACTGACTGTTAGCCTC 


ACGTTGG ATGCGGAATTGACCAACTGGTAG 


rs1 32735 


ACGTTGGATGGGATACTGACTGTTAGCCTC 


ACGTTGGATGCGGAATTGACCAACTGGTAG 


rs80587 


ACGTTGGATGTCTGAGCCAAGCTCACCAGA 


ACGTTGGATGTTTTCCTGCCCAAGGAGGAG 


rs1 32736 


ACGTTGGATGTTTTCCTGCCCAAGGAGGAG 


ACGTTGGATGAAGCTCACCAGATGCAGACG 


rs1 32737 


ACGTTGGATGTCTAGGCAG C AATG AGCTAG 


ACGTTGGATGTGCTCCTCCTGAGAAATCAC 


rs1 32738 


ACGTTGGATGTGAAGCCTGTAATCCCAGTG 


ACGTTGGATGCATAGAGACAGCATCTCCTG 


rs1 807673 


ACGTTGGATGTTGCAGTGAGCAGAGATTGC 


ACGTTGGATGGTGAAATCTGAGTCGTGGTC 


rs2014700 


ACGTTGGATGTTGCAGTGAGCAGAGATTGC 


ACGTTGGATGGTGAAATCTGAGTCGTGGTC 


rs1 32739 


ACGTTGGATGGGAATCAAAGAAGGTGGAGG 


ACGTTGGATGTGGTTGTGGCCAGACCATAA 


rs1812023 


ACGTTGGATGAGCAGGAGGGAGGGAGCAAT 


ACGTTGGATGGACCTCCCTCCATCTCCTTA 


rs1 812024 


ACGTTGGATGTGTCAGAGGAAGATCCCTTG 


ACGTTGGATGCCTCATAGAGCTATTGCGAG 


rs2005590 


ACGI rGGATGGGAGGCAATGCCTGAl IMG 


ACGTTGGATGTGCTTCCACCACCTGGAAAA 


rs1 32740 


ACGTTGGATGTTCATTCTTCTTGTGCACAG 


ACGTTGGATGCTTGACTGGTACCTAACAATG 


rs3986001 


ACGI IGGAIGI I I ICIGACI IGUCAICACC 


ACGTTGGATGCACAAAGTATTCCACCTTCC 


rs24 13390 


ACGTTGGATGATAAATTCGTGGCTGAGCTC 


ACGTTGGATGATCTTGTGGCATAAGGAGTC 


rs1 32743 


ACGTTGGATGACAGGGAGAAAGTGAGGAAG 


ACGTTGGATGCATCCTGTTTCCCCTAAAGG 


rs1 32744 


ACGTTGGATGGGGTCTGTTTCAGGAGCATG 


ACGTTGGATGTCTATGGCTGATGGCCACAG 


rs24 13391 


ACGTTGGATGGGGCAAAAGCAGAAATACTG 


ACG I I GGA I GCCC I CAAACCC I G I I I I C I G 


rs1 32749 


ACGTTGGATGCCATGCACTCTCTAGTACTC 


ACGTTGGATGTGTGGCCTTGGGGAAATGAT 


rs1 32750 


ACGTTGGATGTCCTGTGCCTGTGGAAACTC 


ACGTTGGATGGGTTCTCCAGGTGGCAAAAG 


rs1 32741 


ACGTTGGATGCTACAATTTATCCGCACTAG 


ACGTTGGATGGCCAAGTCAGAAAAATGAGAG 


rs2413388 


ACGTTGGATGTACAGAATTCAGACCAACCC 


ACGI 1 GGA 1 GGCCC 1 GAGA 1 1 IGAI 1 1 ICC 


rs2413389 


ACGTTGGATGGCTAGAATCTCATAACAGACG 


ACGTTGGATGGCGTCCTACTATGATTTGTC 



TABLE 12 



dbSNP 


Extend 


Term 


rs# 


Primer 


Mix 


rs3888818 


TGAGACCAGCCTAGCCAAC 


ACT 


rs20 10605 


TGGTCCCAAGATATTCTATAGA 


ACT 


rs743919 


CTCACCTAAGGACTGCCTCT 


ACT 


rs1008134 


CCGGGCAICI I I I C I I CCA I C 


ACT 


rs 132607 


GCAGGCAGGACAGCATGTG 


ACT 


rs 1476029 


CTTAG AGGCT ATATTAAG AC C A 


ACG 


rs1 476030 


TTTGGCAATACTGGCCTATTC 


ACT 


rs2413380 


TCCAGGAGGGAAGACAACC 


ACT 


rs2051609 


CGCTTGAGGTCAGGACCAG 


CGT 
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dbSNP 


Extend 


Term 




Primer 


Mix 


rs24 13381 


GCGTGTTGCCAACAGCCTC 


ACT 


rs 1894604 


AAATGGGAGGGAATGTTGGC 


ACG 


rs 1894605 


TGCAGATCGCAACTGAGCG 


ACT 


rs1 32609 


GTTTCTCAGAGGATCAGGGA 


ACT 


rs1 32610 


GGTGTGAGATTTGGAGACTTT 


ACG 


rs1 32611 


CTCTGTCCTCTAGCCCCC 


ACT 


rs132612 


ATCTTCCCGCTACCTCAAGAGT 


ACT 


rs1 008790 


CATAATCACAAGTCCTATGATTA 


ACT 


rs23085 


AGGATGACCATGGCAAGGAA 


CGT 


rs105161 


GGCCCTGGCAGGAAACAG 


ACG 


rs132613 


CCAATG GCCTCC ACTG GC 


ACT 


rs132614 


CGGCCACAGCGCTGCCC 


ACG 


rs132615 


GCTTTC AGAAC AACG GTAGAA 


ACT 


rs132617 


AAGAGTGTGTGTGCAGTAGCAAG 


ACG 


rs3865724 


TCACTTAAGCTTTGAATGTTTCTG 


ACT 


rs20 19657 


G CC AGAAC ATTGTGTTTCATTTGT 


ACG 


rs3865725 


GGCAAGAGATACAGAATGCACA 


CGT 


rs20 19364 


GTATCTCTTGCCCCTGCTC 


ACG 


rs2008383 


GAAGGACAGAAGGCTGATGC 


ACG 


rs3986002 


TCCTTCTCTGTGTCACTCCT 


CGT 


rs3888942 


ACATGGAAGCAGGGGTTTGA 


CGT 


rs3888943 


AGCAGGGGTTTGATGAAATCT 


ACT 


rs3888944 


CATCCTCCACATTGGGCCAA 


ACT 


rs132618 


AATCTCAGCTGGAAGTGG 


CGT 


rs132619 


TG C AACC AG C ATTG AC C G 


CGT 


rs3827346 


GAGGCTGCACCATCTCCAA 


ACG 


rs1 32620 


GGAAGCCTTTATTCAGGATTGT 


ACT 


rs1 32621 


TCAAATCTGCAACTGGTGTCAGAA 


ACT 


rs80575 


GAACTCTCAAGCCACTTGAC 


ACT 


rs80576 


GCTAAGGCATTAGTTTGGCTGG 


ACG 


rs80577 


TGAAATTGCACATGGCATTGG 


ACG 


rs80578 


ATGCCTGGGAACTGGGGC 


ACT 


rs80579 


GGGAGGCACTGAGGGCATGAAA 


ACT 


rs80580 


CTGAGAATGAACAGCAGGTCA 


ACT 


rs1 32622 


ATAGTAGTTCAATCAGATGGGC 


ACT 


rs1 32623 


ATTCCAGCCTCTCTGTGTTCTG 


ACT 


rs1 32624 


ACAGGCATGAGCCGCTGC 


ACT 


rs1 32625 


GGCCACCGCATCCGGCTA 


CGT 


rs1 32626 


AGTGG CATG ATCTC G G CC C AC 


ACG 


rs1 32627 


GAGGCGGAGGTTGCGGTG 


ACT 


rs1 807672 


CATTGAGAATAAGGTGGTTCGA 


ACT 


rs1 32628 


CCTCCAAATTCATATACTGAGACC 


ACG 


rs1 32629 


TCTCTCTCTCTCACACAC 


CGT ! 


rs1 32630 


CTGCTCCTGGCTTACAGAG 


ACT 
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dbSNP 


Extend 
Prim6i* 


Term 
Mix 


rs1 32631 


TCACAGTTCTGGAGGCAAAAA 


ACT 


rs1 32632 


I GGAA Mill GCC I CCAGAAC I G I 


ACT 


rs1 32633 


AGTGATTCACCAGGGAAGTGCCA 


ACG 


rs 132634 


GGTGCTTTGTGGAGGAACC 


ACT 


rs1 32635 


ATTTCCCGTACATGGGGAGAAA 


ACT 


rs 132636 


TGACAATAGGCACATGGCAG 


ACT 


rs1 29603 


G CTG CC ATCCTAAAC AC ATCTA 


ACT 


rs1 32637 


ACACAGCAGGATTACTGCCCAGA 


ACT 


rs3788518 


TGGGAGGCTCAAGGAAGAAACTCT 


ACT 


rs 132638 


GGAAAAAATAAAAGCAAAATACCC 


ACT 


rs1 32639 


AAAGCAAACAGGCCTTCAGAA 


CGT 


rs1 32640 


GGTGACACAGAGAAGACGTGGC 


ACT 


rs1 32641 


GGGAGGTCAGAGGTCGG G 


ACT 


rs1 32642 


GTCACTGAGAGACTTTCC 


CGT 


rs1 32643 


GACACCCAGTACACACTGGCT 


ACT 


rs1 32644 


TTTGGAATGAGGAGTCATTTACA 


ACT 


rs1 32645 


CTC AAC A GTAAGC AAG ATTTAAA 


ACT 


rs2017329 


TGATGTTCAGAI I I ICCI I I I I II 


CGT 


rs739198 


TGGTCTCCACAACCTCTTATC 


ACT 


rs1 32647 


AAACCATGGAAGTCTCTAGAGTCA 


ACT 


rs2097465 


CAAACTGCAGGCTTGCCCAG 


ACT 


rs2105915 


CAAGCCTGGGCAGCATAGCAC 


ACT 


rs1 32648 


AATTCCCGTGCTAATGCACG 


ACT 


rs1 32649 


u GAG I C I I A I I AC I I I I G I ACGAGG 


ACT 


rs1 32650 


CATGAGCCACCGTGCCAG 


ACT 


rs1 32651 


CC I CAGGG 1 I I I I CACC 1 GCC 1 


ACT 


rs1 32652 


AGGGCATCCTAACCCCCTA 


CGT 


rs80584 


ATCTACCTGCTCAACTTCCTGA 


ACG 


rs1 32653 


AATAACCAGACACGTTCTCCAG 


ACT 


rs916334 


GTCCAGCAGCACCCTTGGT 


ACG 


rs1 32654 


CGACAAGAGCAGGTCTGGAAC 


ACT 


rs1 32655 


CAGAAGAACCCACATAAGGAA 


ACT 


rs1 32656 


ICI 1 IGICI 1 1 1 AC 1 CCCACA 1 CC 


ACT 


rs3834684 


CACTAGAAGCAAGGAACCCCC 


ACT 


rs1 32657 


TACCTGACAATCACCCCC C 


CGT 


rs916335 


TCAGGTAATTGTCAGTCAGCC 


ACT 


rs 132659 


AGAACTCCCCAAATCGTCCT 


ACG 


rs1 32660 


CCCCAGAG 1 GGGC 1 1 1 ICI 


ACT 


rs1 32661 


CCGCTCTCCCTCTGAGAGT 


ACT 


rs1 32662 


TGATCTGAGTTTACAGGTGAG 


ACT 


rs1 32663 


TGAGATCTGTCTCAGACGCA 


CGT 


rs1 32664 


CTGGGCTAGAGAGGGAGAC 


ACT 


rs1 32667 


C 1 1 1 AAC 1 1 1 1 GC 1 CACAAGAG 1 


ACT 


rs1 32670 


AGACCAGCCTGATCAACATG 


ACT 
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dbSNP 


Extend 


Term 


rs# 


Primer 


Mix 


rs1 32671 


ACATCAATAGGCCTAAAAATCGTT 


ACT 


rs 132672 


GAAACTTGAAATTCCTTGAGAAAT 


ACT 


rs1 32673 


GTGTGAGTGGGAAGCCTCC 


ACG 


rs1 32674 


GGAGTTGGAGGCTGTAGTAA 


ACT 


rs1 32675 


AGATGTGCAATGG AATTTG GCAA 


ACT 


rs80585 


AGGCATAATATTCTGACCATTAAG 


ACG 


rs80586 


GGTCTGCTACTAGAATTCAGAA 


ACG 


rs1 32676 


ATCCCTTAATATTGCATAGGAC 


CGT 


rs1 32677 


GGGTTGAAGTACTATGCTAGT 


CGT 


rs1 32678 


AGGTTAGTTCATGTAACTCCAT 


ACT 


rs1 32680 


I I I IAI I 1 IAGCI IGAGCI 1 1 ICA 


ACG 


rs1 32681 


CTAGCTCTAAATCACATTCTGC 


ACG 


rs1 32683 


CAGGCCCATACCCAAAATATGCT 


ACT 


rs2269594 


TGGCTAAAAGGACAGATAGAG 


ACT 


rs1 32684 


GACACTAAGAGCGGTGAGAC 


ACT 


rs1 32685 


CGTGCCACCCAACTGGAGA 


ACT 


rs 132686 


TGTGCATCTTATGGTGTACCA 


ACG 


rs1 32687 


TCGTTACCCCCATTCTATCC 


ACT 


rs1 32688 


ACAGGACGTGCTTGAAAGAG 


ACG 


rs1 32689 


TGGCGATGGCCTCTGCTC 


ACT 


rs1 32690 


GCTCTCCTTGCTTCAAAAAAAAA 


CGT 


rs1 32691 


TGACCTATCCTGCTTCAGGT 


ACT 


rs1 32692 


CGAAGTGTGTTAGCTCATGAC 


ACT 


rs1 32693 


G AGTGTGG AC ACC AG GTCA 


ACT 


rs 132694 


CACCTTAGGAATGGCAGCTTC 


ACG 


rs1 32695 


TATGACTACACATGCTGGCAAAC 


ACT 


rs1 966266 


GCAGACCCCTAACTCTAATTTG 


ACG 


rs1 966267 


CACTGAGTTATGAGTACTCAAC 


ACT 


rs1 06808 


GATCCTTCCCGAGGACACC 


ACT 


rs1 32696 


CAGGCTGCCTGGAAGGAGA 


ACT 


rs2239829 


GATGGCTGGATTCATAACAGGTAA 


ACT 


rs2285154 


AGTGCTGGGATTACAGGCAT 


ACT 


rs2239830 


CAGTCACCTGAATTTGTGCTTATT 


ACT 


rs2239831 


GTCACTGACCCAAGCTATCCTC 


ACG 


rs3865722 


CAATTGCAAGCAACAGAAACAGAA 


CGT 


rs3865723 


TCCACTGTACTGCTAGTATTG 


ACG 


rs3985996 


AGAATTTCTCCCGGCATCAG 


ACG 


rs3985997 


TCTCCCGGCATCAGCCTTC 


ACG 


rs3985998 


GGGAGGAGTGACACAGAGAAGGA 


ACG 


rs3985999 


GGGGTACTGGGAGGAGTGACAC 


ACT 


rs3986000 


GCAGGACTGGGGTACTGG 


ACT 


rs2413382 


CAGGGGAACTCAGGCCACA 


ACT 


rs2413383 


AG CC ATTG AAG AC ATG G AAG CC 


ACT 


rs2413384 


ATTGAAGACATGGAAGCCGG 


ACT 
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dbSNP 


Extend 


Term 


rarr 






rs24 13385 


GGCCAGCTCTTCCTCCAC 


CGT 


rs24 13386 


CTTGGCCCAATGTGGAGGA 


CGT 


rs 1894606 


CCTAGCGGCAAGGGCTGT 


ACT 


rs916336 


AGGGTATTTGCTGTCTGGCTGTCT 


ACT 


rs916337 


AAGGGCCTGTATGTAGGTTGAA 


ACT 


rs916338 


CCCTCTATGTCTCATGGATTTTCC 


ACG 


rs1 32697 


CCTAGGGGAGCCCATATATCA 


ACT 


rs12781 


AGACAGCTCGAGAGATCC 


ACT 


rs1 053983 


GCTGACTCAGATACACCCC 


ACG 


rs1 053982 


GATGCTGGTAAGACAGGG 


ACG 


rs2227167 


CGTCAAAATCAAGTGCAAA 


ACT 


rs2227168 


CATTAAGGGACATTCTGC 


ACT 


rs1 32700 


ATCCTGTCTGTCATTGGCGTT 


ACT 


rs3075364 


G I I I AGGGAG I GG I I I I I GAAAG 


CGT 


rs2227169 


TGTCCTTTATTGGTACAGGGAAGA 


ACT 


rs2097466 


TACAAGCATGAGCCACCGC 


ACT 


rs2097467 


TCCCAAAGTGCTGGGATTACA 


ACT 


rs24 13387 


TACAGGCACTCACCACCAC 


ACT 


rs1 32701 


TGTACAAAACATATTTAACCTTGA 


ACT 


rs1 32702 


CACCCAGTTAAGGAAAAATTCCT 


ACT 


rs1 32703 


GATCAATGTGTGTTCCCGGA 


ACT 


rs2269595 


GCCCCAGACAGCATCTCC 


ACT 


rs2269596 


TTGCTGGCAAGAGACCAGG 


ACT 


rs 132704 


GGGCTGCCTGGAGGAGG 


ACG 


rs2007468 


TGGGCAATTCAGCCACACGCAC 


ACT 


rs1 32705 


TATAGACTGAATTGTGTGCCC 


ACG 


rs2007706 


GGAATTTACATAAGGGTCTATAG 


ACT 


rs1 32706 


GAACCCCCTCCACTGCCC 


ACT 


rs1 32707 


GCTCTCCCTCTGAAACAAGATG 


ACT 


rs1 32708 


GAGAGCTTCTTCCTTGGCC 


ACT 


rs1 32709 


C AG G G AAG ATTAG AAG CTGAGAGC 


CGT 


rs1 32710 


GGGCAGGGAAGATTAGAAGC 


ACG 


rs1 32711 


TTTGCTGTCCAGGGCGGC 


ACG 


rs132712 


AACCCAGACGGAGGTGGC 


ACG 


rs132713 


GCCCCAACGGAACCCAGA 


ACG 


rs132714 


CCCCTTCTCTACTGAAAATACAAA 


ACT 


rs132716 


TGAGGAGTCATTTACCATGAG 


ACG 


rs132717 


GCAGTTTTACGTGAAGGAGG 


ACT 


rs132718 


GTTTTATACCTAGAGCCACACT 


ACT 


rs132719 


TACCCAGTATTTCTTAACTTCC 


CGT 


rs1 32720 


CAGCTACAAAGTTGCTAATGG 


ACT 


rs1 32721 


ATGTTAGGAAAAGGTGGAGAG 


ACT 


rs1 32722 


CCTAACTGGGATGGGCCTGAA 


ACT 


rs1 32723 


TTCTGGGGCCCCCATGCA 


ACT 
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dbSNP 

rs# 


Extend 


Term 
Mix 


rs1 32724 


ACCCAGTCCTGGGCAGCA 


ACT 


rs1 32725 


GGGAGTATGCAGAGGGGC 


ACT 


rs1 32726 


CAGTGAACAAAGCAGGAAGAAGG 


ACT 


rs1 32727 


GTCACACAGTGAACAAAGCAGGAA 


ACT 


rs1 32728 


CATGGTGTTGAGATGGTTGCC 


ACG 


rs1 32729 


GCTCATGGTGTTGAGATGGTT 


ACT 


rs1 32730 


GTGACCCCCTAGGCCAAGGCA 


ACT 


rs1 32731 


GGCAACCATCTCAACACCAT 


ACT 


rs1 32732 


AACCATCTCAACACCATGAGCCA 


ACT 


rs 132733 


ATCTCAACACCATGAGCCAG 


ACT 


rs1 40575 


GCTCTCTCCTGGTCCTCC 


ACG 


rs1 32734 


AGCCTCAACTAGGACACA 


ACT 


rs 132735 


TCAACTAGGACACAGTGC 


ACT 


rs80587 


GCCAAGCTCACCAGATGCAGA 


ACT 


rs1 32736 


AGGGAGCTGCTTTGCTGAAA 


ACT 


rs1 32737 


CCTGCAGCCTGGGTGACA 


ACT 


rs1 32738 


GGCCAGGAGTTCAAGACAGCCTG 


ACT 


rs1 807673 


GGGCAACAGAGCGAGACTCC 


ACT 


rs20 14700 


GAGCGAGACTCCATCTCA 


ACT 


rs1 32739 


GGGAGGTGACCTGGAGCC 


ACG 


rs1812023 


GCAATCAGACTCAAGCCTGG 


ACT 


rs1812024 


GGGATGGTGTGACCTCCC 


ACG 


rs2005590 


CAAIGCCIGAI I I IGICACIGAAC 


ACT 


rs1 32740 


GGCATATGTGCATTTGTCTGAG 


ACG 


rs3986001 


TCCI I I I ICIAAACCCCTGCAA 


ACT 


rs24 13390 


GGCTGAGCTCAAGGTTTTAAA 


ACT 


rs1 32743 


GGAGAAAGTGAGGAAGAAAATTA 


ACT 


rs1 32744 


TGGGGTTACAGTTGGTCATAACC 


ACT 


rs2413391 


TGATATGTTCAGCGGTGCAC 


ACT 


rs 132749 


TCTTGATGTTTCTCCTATCCC 


ACG 


rs1 32750 


CCTGTGGAAACTCAGCAGC 


ACG 


rs1 32741 


G C ACT AG ATATTG AATTCTTTC C 


ACT 


rs24 13388 


CAACCCCGTGACTGGAGATTC 


CGT 


rs24 13389 


TTTCTCTCTCTAGTACTCTATTT 


ACT 



Genetic Analysis 

[0229] AJlelotyping results from the discovery cohort are shown for cases and controls in Table 13. 
The allele frequency for the A2 allele is noted in the fifth and sixth columns for osteoarthritis case pools 
and control pools, respectively, where "AF" is allele frequency. The allele frequency for the Al allele 
can be easily calculated by subtracting the A2 allele frequency from 1 (Al AF = 1-A2 AF). For example, 
the SNP rs2010605 has the following case and control allele frequencies: case Al (A) = 0.19; case A2 
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0.81; control Al (A) = 0.18; and control A2 (G) = 0.82, where the nucleotide is provided in 
thesis. Some SNPs are labeled "untyped" because of failed assays. 



TABLE 13 



dbSNP 
rs# 


Position in 
SEQ ID NO: 
1 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


Fp- 
Value 


rs3888818 


201 


34781551 


CAT 








rs20 10605 


425 


34781775 


A/G 


0.81 


0.82 


0.782 


rs743919 


1095 


34782445 


G/T 


0.10 


0.11 


0.502 


rs1008134 


2201 


34783551 


A/C 








rs1 32607 


7879 


34789229 


A/G 


0.11 


0.11 


0.813 


rs 1476029 


8395 


34789745 


CAT 


0.15 


0.15 


0.983 


rs1 476030 


8461 


3478981 1 


CAT 


0.36 


0.37 


0.708 


rs24 13380 


9503 


34790853 


CAT 


0.29 


0.29 


0.900 


rs2051609 


10304 


34791654 


G/T 








rs24 13381 


10695 


34792045 


CAT 








rs 1894604 


16300 


34797650 


A/G 


0.08 


0.08 


0.759 


rs 1894605 


16444 


34797794 


GAT 


0.08 


0.09 


0.468 


rs1 32609 


17591 


34798941 


CAT 


0.68 


0.67 


0.777 


rs1 32610 


17988 


34799338 


-/A 








rs1 32611 


19116 


34800466 


-AT 


0.14 


0.15 


0.863 


rs132612 


19358 


34800708 


CAT 


0.23 


0.23 


0.951 


rs1 008790 


20300 


34801650 


A/G 


0.03 


0.10 


0.007 


rs23085 


20669 


L 34802019 


AAT 


0.31 


0.32 


0.738 


rs105161 


20891 


34802241 


A/G 


0.76 


0.77 


0.826 


rs132613 


21451 


34802801 


CAT 


0.80 


0.81 


0.619 


rs 132614 


21978 


34803328 


CAT 


0 16 


0.14 


0.434 


rs132615 


22785 


34804135 


C/G 


0.32 


0.31 


0.740 


rs132617 


24248 


34805598 


CAT 


0.35 


0.36 


0.825 


rs3865724 


24770 


34806120 


CAT 


0.65 


0.65 


0.940 


rs20 19657 


24844 


34806194 


A/G 


0.20 


0.20 


0.857 


rs3865725 


25066 


34806416 


G/T 








rs20 19364 


25096 


34806446 


CAT 


0.40 


0.39 


t 0.767 


rs2008383 


25309 


34806659 


A/G 


0.18 


0.17 


0.665 


rs3986002 


25344 


34806694 


A/C 








rs3888942 


25529 


34806879 


AAT 








rs3888943 


25537 


34806887 


A/G 








rs3888944 


25554 


34806904 


A/C 








rs132618 


27963 


34809313 


AAT 


0.43 


0.43 


0.934 


rs132619 


28134 


34809484 


GAT 








rs3827346 


28356 


34809706 


A/G 


0.84 


0.84 


0.976 


rs1 32620 


29648 


34810998 


-/A 


0.29 


0.29 


0.879 


. rs1 32621 


29986 


34811336 


A/G 


0.32 


0.31 


0.867 


rs80575 


30217 


34811567 


GAT 


0.27 


0.27 


0.948 


rs80576 


30267 


34811617 


A/G 


0.26 


0.25 


0.443 


rs80577 


30315 


34811665 


A/G 


0.26 


0.23 


0.191 


rs80578 


30585 


34811935 


CAT 


0.49 


0.48 


0.548 


rs80579 


30724 


34812074 


A/C 


0.23 


0.25 


0.574 


rs80580 


30897 


34812247 


C/T 


0.31 


0.31 


0.878 


rs1 32622 


30931 


34812281 


CAT 


0.29 


0.30 


0.943 


rs1 32623 


31080 


34812430 


G/T 


0.60 


0.59 


0.806 


rs 132624 


31246 


34812596 


CAT 


0.36 


0.37 


0.772 


rs1 32625 


31373 


34812723 


AAT 








rs 132626 


31463 


34812813 


A/G 


0.89 


0.84 


0.082 


rs1 32627 


31467 


34812817 


A/G 


0.12 


0.11 


0.836 


rs 1807672 


32188 


34813538 


G/T 


0.30 


0.30 


0.974 


rs1 32628 


32288 


34813638 


CAT 


0.25 


0.24 


0.691 


rs 132629 


32520 


34813870 


AAT 


0.04 


0.06 


0.250 
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dbSNP 
rs# 


Position in 
SEQ ID NO: 
1 


Chrnmosomp 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


F D- 

Value 


rs1 32630 


32594 


34813944 


A/C 


0.75 


0.75 


0.978 


rs1 32631 


32657 


34814007 


A/C 


0.72 


0.73 


0.509 


rs1 32632 


32677 


34814027 


A/G 


0.66 


0.65 


0.798 


rs1 32633 


32764 


34814114 


C/T 


0.34 


0.33 


0.796 


rs1 32634 


32784 


34814134 


A/G 


0.45 


0.42 


0.317 


rs1 32635 


32830 


34814180 


C/T 


0.41 


0.40 


0.772 


rs1 32636 


32872 


34814222 


C/T 


0.41 


0.44 


0.192 


rs1 29603 


33121 


34814471 


A/C 








rs1 32637 


33348 


34814698 


G/T 


0.09 


0.09 


0.628 


rs3788518 


33952 


34815302 


C/G 


0.17 


0.19 


0.297 


rs1 32638 


34184 


34815534 


C/G 


0.56 


0.58 


0.509 


rs1 32639 


34361 


34815711 


A/T 


0.13 


0.12 


0.561 


rs1 32640 


35026 


34816376 


A/G 


0.32 


0.30 


0.388 


rs1 32641 


35192 


34816542 


A/G 


0.48 


0.51 


0.287 


rs1 32642 


35600 


34816950 


A/T 


0.15 


0.14 


0.732 


rs1 32643 


36033 


34817383 


C/T 


0.44 


0.46 


0.360 


rs1 32644 


36289 


34817639 


C/T 


0.53 


0.58 


0.075 


rs1 32645 


38869 


34820219 


A/G 


0.19 


0.18 


0.572 


rs2017329 


39629 


34820979 


A/T 


0.39 


0.40 


0.915 


rs739198 


40530 


34821880 


C/T 


0.70 


0.70 


0.878 


rs1 32647 


41621 


34822971 


C/T 


0.23 


0.23 


0.957 


rs2097465 


42379 


34823729 


C/T 


0.54 


0.51 


0.344 


rs2105915 


42802 


34824152 


C/T 


0.57 


0.56 


0.468 


rs1 32648 


42865 


34824215 


T/C 








rs1 32649 


43644 


34824994 


A/G 


0.21 


0.22 


0.579 


rs 132650 


45051 


34826401 


C/T 


0.34 


0.31 


0.248 


rs1 32651 


45828 


34827178 


A/C 








rs1 32652 


45829 


34827179 


A/T 








rs80584 


46257 


34827607 


C/T 


0.81 


0.75 


0.043 


rs 132653 


47286 


34828636 


A/C 


0.17 


0.15 


0.312 


rs916334 


47427 


34828777 


C/T 


0.34 


0.36 


0.345 


rs1 32654 


47963 


34829313 


C/T 


0.54 


0.56 


0.439 


rs1 32655 


48013 


34829363 


C/T 


0.41 


0.41 


0.838 


rs1 32656 


48229 


34829579 


C/T 


0.37 


0.36 


0.813 


rs3834684 


48282 


34829632 


-/A 


0.21 


0.22 


0.480 


rs1 32657 


48376 


34829726 


-/G 


0.49 


0.50 


0.719 


rs916335 


48404 


34829754 


A/G 


0.38 


0.36 


0.509 


rs1 32659 


49900 


34831250 


C/T 








rs 132660 


52699 


34834049 


G/T 


0.37 


0.38 


0.754 


rs 132661 


52897 


34834247 


A/G 


0.27 


0.29 


0.291 


rs1 32662 


53414 


34834764 


A/G 


0.61 


0.58 


0.186 


rs1 32663 


53487 


34834837 


A/T 


0.26 


0.29 


0.167 


rs1 32664 


54112 


34835462 


G/T 


0.25 


0.29 


0.098 


rs1 32667 


55492 


34836842 


A/G 


0.35 


0.36 


0.559 


rs 132670 


59766 


34841116 


C/T 








rs1 32671 


60307 


34841657 


A/G 


0.49 


0.53 


0.145 


rs 132672 


60701 


34842051 


A/G 


0.23 


0.22 


0.716 


rs 132673 


60952 


34842302 


A/G 


0.41 


0.37 


0.184 


rs1 32674 


61401 


34842751 


C/T 


0.32 


0.31 


0.476 


rs1 32675 


62379 


34843729 


C/T 


0.38 


0.35 


0.188 


rs80585 


62870 


34844220 


C/T 


0.28 


0.27 


0.525 


rs80586 


62879 


34844229 


A/G 


0.66 


0.66 


0.966 


rs1 32676 


63499 


34844849 


A/T 


0.26 


0.23 


0.177 


rs1 32677 


64284 


34845634 


-/A 


0.69 


0.69 


0.873 


rs1 32678 


64408 


34845758 


A/G 


0.46 


0.48 


0.395 


rs1 32680 


64760 


34846110 


A/G 


0.20 


0.20 


0.995 


rs1 32681 


65230 


34846580 


A/G 


0.24 


0.24 


0.901 
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rfhSNP 

UUiJll IT 


Position in 
SEQ ID NO: 
1 


th rnmncnmp 

V«11I U1IIU3U1IIC 

Position 


A1/A2 
Allele 


F A2 
Ca^p AF 


F A2 
Control AF 


F D- 

Vsliie 


rs1 32683 


66127 


34847477 


A/G 


0.19 


0.19 


0.851 


rs2269594 


66634 


34847984 


C/T 


0.70 


0.67 


0.332 


rs1 32684 


66686 


34848036 


A/G 


0.21 


0.20 


0.756 


rs1 32685 


66694 


34848044 


C/G 


0.30 


0.28 


0.553 


rs1 32686 


67113 


34848463 


A/G 


0.46 


0.48 


0.398 


rs1 32687 


67257 


34848607 


A/G 


0.96 


0.96 


0.767 


rs 132688 


67403 


34848753 


A/G 


0.24 


0.23 


0.553 


rs1 32689 


67609 


34848959 


A/G 


0.61 


0.63 


0.564 


rs1 32690 


68418 


34849768 


-/A 


0.16 


0.17 


0.672 


rs1 32691 


68610 


34849960 


C/G 


0.52 


0.52 


0.976 


rs1 32692 


69629 


34850979 


C/T 


0.63 


0.62 


0.800 


rs1 32693 


70024 


34851374 


A/G 


0.58 


0.58 


0.868 


rs1 32694 


70848 


34852198 


A/G 


0.17 


0.16 


0.583 


rs 132695 


71428 


34852778 


C/G 


0.23 


0.22 


0.616 


rs1 966266 


71553 


34852903 


C/T 


0.49 


0.47 


0.413 


rs1 966267 


71633 


34852983 


A/G 


0.40 


0.41 


0.773 


rs1 06808 


71768 


34853118 


A/C 


0.68 


0.67 


0.617 


rs1 32696 


71769 


34853119 


A/G 








rs2239829 


73039 


34854389 


A/G 


0.34 


0.36 


0.510 


rs2285154 


73325 


34854675 


A/G 








rs2239830 


73412 


34854762 


A/C 


0.49 


0.50 


0.841 


rs2239831 


73547 


34854897 


C/T 


0.52 


0.50 


0.564 


rs3865722 


73769 


34855119 


A/T 


0.57 


0.56 


0.861 


rs3865723 


73806 


34855156 


A/G 


0.59 


0.58 


0.722 


rs3985996 


74467 


34855817 


C/T 


0.30 


0.29 


0.861 


rs3985997 


74472 


34855822 


C/T 


0.89 


0.90 


0.527 


rs3985998 


74473 


34855823 


A/G 








rs3985999 


74482 


34855832 


C/T 


0.19 


0.19 


0.968 


rs3986000 


74494 


34855844 


AC 








rs24 13382 


74592 


34855942 


A/G 


0.61 


0.59 


[ 0.618 


rs24 13383 


74670 


34856020 


G/T 








rs24 13384 


74672 


34856022 


G/T 








rs24 13385 


74714 


34856064 


G/T 


0.70 


0.70 


0.944 


rs24 13386 


74723 


34856073 


A/T 


0.70 


0.71 


0.816 


rs1 894606 


74749 


34856099 


A/G 








rs9 16336 


74861 


34856211 


C/G 








rs9 16337 


74892 


34856242 


C/T 








rs9 16338 


74893 


34856243 


C/T 


0.40 


0.40 


0.939 


rs1 32697 


75176 


34856526 


A/G 


0.59 


0.61 


0.418 


rs12781 


75705 


34857055 


A/G 








rs1 053983 


75989 


34857339 


A/G 


0.42 


0.43 


0.848 


rs1 053982 


76027 


34857377 


A/G 


0.69 


0.69 


0.981 


rs2227167 


77949 


34859299 


A/G 


0.35 


0.37 


0.392 


rs2227168 


77974 


34859324 


C/T 


0.38 


0.38 


0.797 


rs1 32700 


78167 


34859517 


C/T 


0.25 


0.25 


0.729 


rs3075364 


78310 


34859660 


-/CT 


0.42 


0.42 


0.879 


rs2227169 


78415 


34859765 


C/T 


0.38 


0.37 


0.619 


rs2097466 


78575 


34859925 


C/T 


0.52 


0.52 


0.962 


rs2097467 


78590 


34859940 


C/T 








rs2413387 


78709 


34860059 


C/T 








rs 132701 


78875 


34860225 


C/T 


0.23 


0.23 


0.951 


rs1 32702 


79864 


34861214 


C/T 








rs1 32703 


81316 


34862666 


C/T 


0.52 


0.52 


0.995 


rs2269595 


81320 


34862670 


A/G 


0.19 


0.18 


0.464 


rs2269596 


81409 


34862759 


C/T 


0.40 


0.40 


0.753 


rs1 32704 


81737 


34863087 


C/T 


0.65 


0.66 


0.906 


rs2007468 


81843 


34863193 


A/G 


0.40 


0.40 


0.987 
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dbSNP 
rs# 


Position in 
SEQ ID NO: 
1 


f"h mm n^n m p 

V^III UII1U3U1I1C 

PositioD 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


F D- 

Value 


rs1 32705 


82102 


34863452 


C/T 


0.30 


0.30 


0.944 


rs2007706 


82833 


34864183 


C/T 


0.32 


0.32 


0.944 


rs1 32706 


83461 


34864811 


err 








rs1 32707 


83624 


34864974 


err 


0.26 


0.28 


0.540 


rs1 32708 


83660 


34865010 


C/G 


0.30 


0.29 


0.506 


rs1 32709 


83701 


34865051 


A/T 


0.45 


0.46 


0.538 


rs1 32710 


83708 


34865058 


A/G 


0.62 


0.60 


0.310 


rs1 32711 


83782 


34865132 


C/T 








rs132712 


85707 


34867057 


A/G 


0.84 


0.84 


0.693 


rs132713 


85717 


34867067 


A/G 


0.29 


0.30 


0.911 


rs1 32714 


86486 


34867836 


C/T 








rs132716 


86833 


34868183 


A/G 








rs132717 


87115 


34868465 


C/T 


0.48 


0.49 


0.647 


rs132718 


87234 


34868584 


A/G 


0.68 


0.69 


0.453 


rs132719 


87479 


34868829 


G/T 


0.59 


0.54 


0.078 


rs1 32720 


87561 


3486891 1 


A/G 


0.12 


0.13 


0.607 


rs1 32721 


87604 


34868954 


A/G 


0.73 


0.73 


0.919 


rs1 32722 


87674 


34869024 


C/T 


0.22 


0.22 


0.985 


rs1 32723 


87958 


34869308 


A/G 


0.03 


NA 


0.033 


rs1 32724 


87992 


34869342 


-/G 


0.12 


0.12 


0.830 


rs1 32725 


88019 


34869369 


A/G 


0.69 


0.64 


0.097 


rs1 32726 


88074 


34869424 


C/G 


0.10 


0.13 


0.240 


rs1 32727 


88079 


34869429 


C/G 


0.68 


0.67 


0.865 


rs1 32728 


88115 


34869465 


A/G 








rs1 32729 


88118 


[ 34869468 


C/G 








rs1 32730 


88120 


34869470 


A/G 








rs1 32731 


88135 


34869485 


-/CTCAT 








rs1 32732 


88142 


34869492 


G/T 








rs1 32733 


88143 


34869493 


G/T 








rs1 40575 


88149 


34869499 


ACA/TG 


0.37 


0.38 


0.656 


rs1 32734 


88340 


34869690 


A/G 


0.46 


0.44 


0.522 


rs1 32735 


88344 


34869694 


G/T 


0.85 


0.85 


0.781 


rs80587 


88512 


34869862 


C/G 


0.40 


0.42 


0.493 


rs1 32736 


88521 


34869871 


C/T 








rs1 32737 


88650 


34870000 


C/G 


0.32 


0.33 


0.824 


rs1 32738 


88827 


34870177 


C/T 


0.78 


0.80 


0.327 


rs1 807673 


89230 


34870580 


A/G 


0.28 


0.28 


0.969 


rs2014700 


89236 


34870586 


A/G 


0.89 


0.92 


0.036 


rs1 32739 


90754 


34872104 


G/A 








rs1812023 


90984 


34872334 


A/G 


0.70 


0.70 


0.997 


rs1812024 


91110 


34872460 


A/G 


0.68 


0.69 


0.755 


rs2005590 


92026 


34873376 


C/T 


0.71 


0.73 


0.252 


rs 132740 


92954 


34874304 


C/T 


0.09 


0.08 


0.370 


rs3986001 


93375 


34874725 


-/TTGC 


0.51 


0.54 


0.302 


rs24 13390 


93794 


34875144 


C/T 


0.37 


0.41 


0.103 


rs1 32743 


94937 


34876287 


C/G 


0.07 


0.07 


0.985 


rs1 32744 


95068 


34876418 


C/T 


0.73 


0.78 


0.078 


rs2413391 


96188 


34877538 


A/G 


0.39 


0.40 


0.938 


rs1 32749 


97092 


34878442 


C/T 


0.47 


0.50 


0.329 


rs 132750 


98812 


34880162 


C/T 


0.67 


0.67 


0.710 


rs1 32741 


not mapped 


not mapped 


A/C 


0.29 


0.30 


0.608 


rs2413388 


not mapped 


not mapped 


A/T 


0.31 


0.31 


0.967 


rs24 13389 


not mapped 


not mapped 


C/G 


0.30 


0.28 


0.393 
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[0230] The APOL3 proximal SNPs were also allelotyped in the replication cohorts using the 
methods described herein and the primers provided in Tables 1 1 and 12. The replication allelotyping 
results for replication cohort #1 and replication cohort #2 are provided in Tables 14 and 15, respectively. 



TABLE 14 



iSrr 


Position in 
SEQ ID NO: 
1 


v_nromosuiiK 
r uaiiiuu 


A1/A2 
/vueie 


F A2 


F A2 


F n- 
Valuc 


rs3888818 


201 


34781551 


C/T 








rs2010605 


425 


34781775 


A/G 


0.79 


0.80 


0.664 


rs743919 


1095 


34782445 


G/T 


0.11 


0.11 


0.982 


rs1008134 


2201 


34783551 


A/C 








rs1 32607 


7879 


34789229 


A/G 


0.10 


0.11 


0.589 


rs1 476029 


8395 


34789745 


C/T 


0.16 


0.13 


0.177 


rs1 476030 


8461 


3478981 1 


C/T 


0.33 


0.38 


0.149 


rs24 13380 


9503 


34790853 


C/T 


0.26 


0.30 


0.174 


rs2051609 


10304 


34791654 


G/T 








rs2413381 


10695 


34792045 


C/T 








rs1 894604 


16300 


34797650 


A/G 


0.07 


0.08 


0.476 


rs1 894605 


16444 


34797794 


G/T 


0.07 


0.09 


0.269 


rs1 32609 


17591 


34798941 


C/T 


0.68 


0.67 


0.687 


rs1 32610 


17988 


34799338 


-/A 








rs1 32611 


19116 


34800466 


-IT 


0.14 


0.14 


0.992 


rs132612 


19358 


34800708 


C/T 


0.22 


0.22 


0.799 


rs1 008790 


20300 


34801650 


A/G 


0.03 


0.06 


0.358 


rs23085 


20669 


34802019 


A/T 


0.31 


0.30 


0.868 


rs105161 


20891 


34802241 


A/G 


0.75 


0.80 


0.058 


rs132613 


21451 


34802801 


C/T 


0.82 


0.81 


0.725 


rs132614 


21978 


34803328 


C/T 


0.15 


0.15 


0.914 


rs132615 


22785 


34804135 


C/G 


0.32 


0.32 


0.885 


rs132617 


24248 


34805598 


C/T 


0.36 


0.37 


0.706 


rs3865724 


24770 


34806120 


C/T 


0.66 


0.64 


0.473 


rs20 19657 


24844 


34806194 


A/G 


0.21 


0.21 


0.898 


rs3865725 


25066 


34806416 


G/T 








rs20 19364 


25096 


34806446 


C/T 


0.41 


0.39 


0.480 


rs2008383 


25309 


34806659 


A/G 


0.17 


0.16 


0.721 


rs3986002 


25344 


34806694 


A/C 








rs3888942 


25529 


34806879 


A/T 








rs3888943 


25537 


34806887 


A/G 








rs3888944 


25554 


34806904 


A/C 








rs132618 


27963 


34809313 


A/T 


0.42 


0.45 


0.522 


rs132619 


28134 


34809484 


G/T 








rs3827346 


28356 


34809706 


A/G 


0.85 


0.84 


0.703 


rs1 32620 


29648 


34810998 


-/A 


0.29 


0.28 


0.917 


rs1 32621 


29986 


34811336 


A/G 


0.32 


0.31 


0.864 


rs80575 


30217 


34811567 


G/T 


0.27 


0.27 


0.909 


rs80576 


30267 


34811617 


A/G 


0.30 


0.22 


0.007 


rs80577 


30315 j 


34811665 


A/G 


0.24 


0.23 


0.853 


rs80578 


30585 


34811935 


C/T 


0.49 


0.49 


0.884 


rs80579 


30724 


34812074 


A/C 


0.24 


0.25 


0.761 


rs80580 


30897 


34812247 


C/T 


0.30 


0.30 


0.815 


rs1 32622 


30931 


34812281 


C/T 


0.29 


0.28 


0.884 


rs1 32623 


31080 


34812430 


G/T 


0.62 


0.63 


0.633 


rs1 32624 


31246 


34812596 


C/T 


0.36 


0.35 


0.769 


rs1 32625 


31373 


34812723 


A/T 








rs1 32626 


31463 


34812813 


A/G 


0.92 


NA 


NA 


rs1 32627 


31467 


34812817 


A/G 


0.10 


0.10 


0.844 


rs1 807672 


32188 


34813538 


G/T 


0.29 


0.29 


0.954 
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dbSNP 
rs# 


Position in 
SEQ ID NO: 
1 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


F p- 
Value 


rs1 32628 


32288 


34813638 


C/T 


0.24 


0.25 


0.796 


rs 132629 


32520 


34813870 


A/T 


0.03 


0.06 


0.345 


rs 132630 


32594 


34813944 


A/C 


0.75 


0.72 


0.388 


rs1 32631 


32657 


34814007 


A/C 


0.70 


0.77 


0.017 


rs1 32632 


32677 


34814027 


A/G 


0.67 


0.63 


0.275 


rs1 32633 


32764 


34814114 


C/T 


0.34 


0.32 


0.558 


rs1 32634 


32784 


34814134 


A/G 


0.44 


0.41 


0.435 


rs 132635 


32830 


34814180 


C/T 


0.39 


NA 


0.409 


rs 132636 


32872 


34814222 


C/T 


0.42 


0.46 


0.179 


rs 129603 


33121 


34814471 


A/C 








rs 132637 


33348 


34814698 


G/T 


0.08 


0.10 


0.462 


rs3788518 


33952 


34815302 


C/G 


0.19 


0.19 


0.855 


rs1 32638 


34184 


34815534 


C/G 


0.54 


0.60 


0.114 


rs 132639 


34361 


34815711 


A/T 


0.13 


0.09 


0.153 


rs 132640 


35026 


34816376 


A/G 


0.31 


0.32 


0.767 


rs 132641 


35192 


34816542 


A/G 


0.46 


0.53 


0.074 


rs 132642 


35600 


34816950 


A/T 


0.16 


0.12 


0.101 


rs 132643 


36033 


34817383 


C/T 


0.42 


0.48 


0.081 


rs 132644 


36289 


34817639 


C/T 


0.50 


0.62 


-0.0001 


rs 132645 


38869 


34820219 


A/G 


0.21 


0.15 


0.027 


rs20 17329 


39629 


34820979 


A/T 


0.40 


0.38 


0.737 


rs739198 


40530 


34821880 


C/T 


0.74 


NA 


0.697 


rs1 32647 


41621 


34822971 


C/T 


0.21 


0.24 


0.430 


rs2097465 


42379 


34823729 


C/T 


0.54 


0.52 


0.670 


rs2105915 


42802 


34824152 


C/T 


0.57 


0.55 


0.582 


rs1 32648 


42865 


34824215 


T/C 








rs1 32649 


43644 


34824994 


A/G 


0.20 


0.23 


0.213 


rs 132650 


45051 


34826401 


C/T 


0.36 


0.29 


0.033 


rs 132651 


45828 


34827178 


A/C 








rs 132652 


45829 


34827179 


A/T 








rs80584 


46257 


34827607 


C/T 


0.81 


0.75 


0.043 


rs1 32653 


47286 


34828636 


A/C 


0.18 


0.13 


0.060 


rs9 16334 


47427 


34828777 


C/T 


0.32 


0.38 


0.064 


rs1 32654 


47963 


34829313 


C/T 


0.55 


0.54 


0.769 


rs 132655 


48013 


34829363 


C/T 


0.42 


0.39 


0.390 


rs1 32656 


48229 


34829579 


C/T 


0.38 


0.35 


0.429 


rs3834684 


48282 


34829632 


-/A 


0.21 


0.23 


0.416 


rs1 32657 


48376 


34829726 


-/G 


0.47 


0.53 


0.089 


rs91 6335 


48404 


34829754 


A/G 


0.39 


0.33 


0.153 


rs1 32659 


49900 


34831250 


C/T 








rs 132660 


52699 


34834049 


G/T 


0.39 


0.36 


0.227 


rs1 32661 


52897 


34834247 


A/G 


0.25 


0.31 


0.086 


rs1 32662 


53414 


34834764 


A/G 


0.63 


0.57 


0.083 


rs1 32663 


53487 


34834837 


A/T 


0.25 


0.31 


0.051 


rs1 32664 


54112 


34835462 


G/T 


0.23 


0.29 


0.037 


rs1 32667 


55492 


34836842 


A/G 


0.34 


0.32 


0.546 


rs 132670 


59766 


34841116 


C/T 








rs 132671 


60307 


34841657 


A/G 


0.49 


0.55 


0.041 


rs1 32672 


60701 


34842051 


A/G 


0.24 


0.20 


0.211 


rs1 32673 


60952 


34842302 


A/G 


0.40 


0.36 


0.284 


rs1 32674 


61401 


34842751 


C/T 


0.32 


0.29 


0.453 


rs1 32675 


62379 


34843729 


C/T 


0.38 


0.33 


0.178 


rs80585 


62870 


34844220 


C/T 


0.29 


0.24 


0.164 


rs80586 


62879 


34844229 


A/G 


0.65 


0.69 


0.212 


rs1 32676 


63499 


34844849 


A/T 


0.26 


0.20 


0.034 


rs1 32677 


64284 


34845634 


-/A 


0.69 


0.72 


0.438 


rs1 32678 


64408 


34845758 


A/G 


0.48 


0.50 


0.572 
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dbSNP 

rs# 


Position in 
SEQ ID NO: 
1 


{"'h rnmnsfimp 

X— . 1J 1 U11IU JU11IV 

Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


F p- 
Value 


rs1 32680 


64760 


34846110 


A/G 


0.20 


0.18 


0.429 


rs1 32681 


65230 


34846580 


A/G 


0.25 


0.22 


0.362 


rs1 32683 


66127 


34847477 


A/G 


0.19 


0.17 


0.449 


rs2269594 


66634 


34847984 


C/T 


0.70 


0.65 


0.234 


rs1 32684 


66686 


34848036 


A/G 


0.21 


0.18 


0.349 


rs1 32685 


66694 


34848044 


C/G 


0.30 


0.28 


0.528 


rs1 32686 


67113 


34848463 


A/G 


0.46 


0.50 


0.207 


rs1 32687 


67257 


34848607 


A/G 


0.95 


0.98 


0.217 


rs1 32688 


67403 


34848753 


A/G 


0.26 


0.23 


0.367 


rs1 32689 


67609 


34848959 


A/G 


0.60 


0.63 


0.353 


rs1 32690 


68418 


34849768 


-/A 


0.16 


0.15 


0.901 


rs1 32691 


68610 


34849960 


C/G 


0.51 


0.53 


0.573 


rs1 32692 


69629 


34850979 


C/T 


0.62 


0.64 


0.566 


rs1 32693 


70024 


34851374 


A/G 


0.58 


0.62 


0.227 


rs1 32694 


70848 


34852198 


A/G 


0.17 


0.14 


0.195 


rs1 32695 


71428 


34852778 


C/G 


0.24 


0.22 


0.560 


rs1 966266 


71553 


34852903 


C/T 


0.49 


0.46 


0.389 


rs1 966267 


71633 


34852983 


A/G 


0.40 


0.43 


0.304 


rs1 06808 


71768 


34853118 


A/C 


0.66 


0.70 


0.243 


rs 132696 


71769 


34853119 


A/G 








rs2239829 


73039 


34854389 


A/G 


0.34 


0.38 


0.264 


rs2285154 


73325 


34854675 


A/G 








rs2239830 


73412 


34854762 


A/C 


0.49 


NA 


0.490 


rs2239831 


73547 


34854897 


C/T 


0.52 


0.48 


0.254 


rs3865722 


73769 


34855119 


A/T 


0.57 


0.56 


0.739 


rs3865723 


73806 


34855156 


A/G 


0.58 


0.57 


0.819 


rs3985996 


74467 


34855817 


C/T 


0.30 


0.31 


0.799 


rs3985997 


74472 


34855822 


C/T 


0.88 


0.89 


0.941 


rs3985998 


74473 


34855823 


A/G 








rs3985999 


74482 


34855832 


C/T 


0.20 


0.20 


0.871 


rs3986000 


74494 


34855844 


A/C 








rs2413382 


74592 


34855942 


A/G 


0.61 


0.58 


0.311 


rs24 13383 


74670 


34856020 


G/T 








rs24 13384 


74672 


34856022 


G/T 








rs2413385 


74714 


34856064 


G/T 


0.70 


0.68 


0.509 


rs2413386 


74723 


34856073 


A/T 


0.71 


0.70 


0.807 


rs1 894606 


74749 


34856099 


A/G 








rs9 16336 


74861 


34856211 


C/G 








rs9 16337 


74892 


34856242 


C/T 








rs9 16338 


74893 


34856243 


C/T 


0.41 


0.39 


0.465 


rs1 32697 


75176 


34856526 


A/G 


0.58 


0.63 


0.101 


rs12781 


75705 


34857055 


A/G 








rs1 053983 


75989 


34857339 


A/G 


0.41 


0.45 


0.301 


rs1 053982 


76027 


34857377 


A/G 


0.66 


0.66 


0.959 


rs2227167 


77949 


34859299 


A/G 


0.35 


0.39 


0.313 


rs2227168 


77974 


34859324 


C/T 


0.39 


0.35 


0.234 


rs 132700 


78167 


34859517 


C/T 


0.26 


0.24 


0.652 


rs3075364 


78310 


34859660 


-/CT 


0.42 


0.45 


0.287 


rs2227169 


78415 


34859765 


C/T 


0.40 


0.36 


0.293 


rs2097466 


78575 


34859925 


C/T 


0.52 


0.49 


0.376 


rs2097467 


78590 


34859940 


C/T 








rs24 13387 


78709 


34860059 


C/T 








rs 132701 


78875 


34860225 


C/T 


0.23 


0.21 


0.499 


rs 132702 


79864 


34861214 


C/T 








rs1 32703 


81316 


34862666 


C/T 


0.52 


0.52 


0.995 


rs2269595 


81320 


34862670 


A/G 


0.20 


0.17 


0.210 


rs2269596 


81409 


34862759 


C/T 


0.42 


0.37 


0.172 
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dbSNP 

YSU 


Position in 
SEQ ID NO: 
1 


Vlll UlllUSUIIlv 

Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 

\J till \JM <u 


F D- 

Value 


rs1 32704 


81737 


34863087 


C/T 


0.64 


0.67 


0.398 


rs2007468 


81843 


34863193 


A/G 


0.39 


0.42 


0.354 


rs1 32705 


82102 


34863452 


err 


0.29 


0.28 


0.834 


rs2007706 


82833 


34864183 


C/T 


0.29 


0.27 


0.546 


rs 132706 


83461 


34864811 


C/T 








rs 132707 


83624 


34864974 


C/T 


0.26 


0.29 


0.252 


rs 132708 


83660 


34865010 


C/G 


0.29 


0.28 


0.584 


rs 132709 


83701 


34865051 


A/T 


0.46 


0.47 


0.748 


rs1 32710 


83708 


34865058 


A/G 


0.62 


0.60 


0.477 


rs1 32711 


83782 


34865132 


C/T 








rs132712 


85707 


34867057 


A/G 


0.82 


0.82 


0.984 


rs132713 


85717 


34867067 


A/G 


0.31 


0.33 


0.416 


rs132714 


86486 


34867836 


C/T 








rs132716 


86833 


34868183 


A/G 








rs132717 


87115 


34868465 


C/T 


0.48 


NA 


NA 


rs132718 


87234 


34868584 


A/G 


0.67 


0.71 


0.237 


rs132719 


87479 


34868829 


G/T 


0.56 


0.52 


0.254 


rs1 32720 


87561 


3486891 1 


A/G 


0.12 


0.14 


0.416 


rs 132721 


87604 


34868954 


A/G 


0.72 


0.71 


0.626 


rs1 32722 


87674 


34869024 


C/T 


0.23 


0.24 


0.791 


rs1 32723 


87958 


34869308 


A/G 


NA 


NA 




rs1 32724 


87992 


34869342 


-/G 


0.12 


0.12 


0.830 


rs1 32725 


88019 


34869369 


A/G 


0.60 


NA 


0.691 


rs1 32726 


88074 


34869424 


C/G 


0.09 


0.15 


0.103 


rs1 32727 


88079 


34869429 


C/G 


0.68 


0.66 


0.536 


rs1 32728 


88115 


34869465 


A/G 








rs1 32729 


88118 


34869468 


C/G 








rs1 32730 


88120 


34869470 


A/G 








rs1 32731 


88135 


34869485 


-/CTCAT 








rs1 32732 


88142 


34869492 


G/T 








rs1 32733 


88143 


34869493 


G/T 








rs1 40575 


88149 


34869499 


ACA/TG 


0.43 


0.44 


0.792 


rs1 32734 


88340 


34869690 


A/G 


0.45 


0.41 


0.263 


rs 132735 


88344 


34869694 


G/T 


0.83 


0.82 


0.629 


rs80587 


88512 


34869862 


C/G 


0.42 


0.47 


0.152 


rs1 32736 


88521 


34869871 


C/T 








rs1 32737 


88650 


34870000 


C/G 


0.33 


0.35 


0.631 


rs1 32738 


88827 


34870177 


C/T 


0.74 


0.79 


0.131 


rs1 807673 


89230 


34870580 


A/G 


0.30 


0.32 


0.583 


rs20 14700 


89236 


34870586 


A/G 


0.85 


0.90 


0.043 


rs1 32739 


90754 


34872104 


G/A 








rs1812023 


90984 


34872334 


A/G 


0.70 


NA 


0.704 


rs1812024 


91110 


34872460 


A/G 


0.66 


0.68 


0.563 


rs2005590 


92026 


34873376 


C/T 


0.72 


0.75 


0.268 


rs 132740 


92954 


34874304 


C/T 


0.10 


0.06 


0.085 


rs3986001 


93375 


34874725 


-/TTGC 


0.49 


0.53 


0.341 


rs24 13390 


93794 


34875144 


C/T 


0.35 


0.40 


0.143 


rs1 32743 


94937 


34876287 


C/G 


0.06 


0.05 


0.608 


rs1 32744 


95068 


34876418 


C/T 


0.72 


0.78 


0.069 


rs24 13391 


96188 


34877538 


A/G 


0.37 


0.39 


0.544 


rs1 32749 


97092 


34878442 


C/T 


0.48 


0.51 


0.311 


rs1 32750 


98812 


34880162 


C/T 


0.65 


0.67 


0.678 


rs1 32741 


not mapped 


not mapped 


A/C 


0.28 j 


0.30 


0.509 


rs24 13388 


not mapped 


not mapped 


A/T 


0.32 


0.32 


0.908 


rs24 13389 


not mapped 


not mapped 


C/G 


0.32 


0.29 


0.480 
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TABLE 15 



dbSNP 
rs# 


Position in 
SEQ ID NO: 
1 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


Fp- 
Value 


rsoooooio 


zUI 


*XA7fH^K.-\ 
04/ OlDOl 


rr 
L»/ I 








rS^UlUoUO 


./IOC. 

4zO 






n Qyl 


U.o4 


rv qoa 
u.yz4 


rsf 4oyiy 


iuyo 


*}A7Q1AjI& 


rr 


n no 


u.n 


u.^y4 


rsiuuoio4 


zzUI 












rsi ozbU/ 


7Q7A 


*3>(7f5QOOQ 




U.lo 


A ^A 


rt 007 


rsi 4/ buzy 


QOQC 

ooyo 




rr 
O/ I 


U.lo 


U.lo 


rv 4 oc 


rs i 4/duou 


o4b1 


o4/oaol 1 


O/ I 


Pi y a 

U.4U 


n 

U.OO 


U.^14 


rsz41oooU 


youo 




O/ 1 


n 00 


n 00 


rv ici 
U.I DO 


rs^Uoi buy 


10304 


o47yioo4 


r> rr 
(j/l 








rsz4ioooi 


iubyo 


O yl Tfionyl E 

o4/9zU4o 


/-» /T 
0/ I 








rsi oy4bU4 


1b30U 


o yi 7n7Ccn 




0.09 


n no 


rv 71c 


rsi o94b(Jo 


4 C A A A 

1b444 


o4797794 


Gi/T 


rv Art 
0.09 


t\ r\r\ 

0.09 


rt OO yl 

0.834 


rsiozouy 


i /oyi 


o4/yay4i 




O.b/ 


n c"7 


rv ncn 
0.9O0 


rsiozbiu 


a ~7(~\C> O 

i /9oo 


O yl OO 

o4/y9ooo 


/A 

-/A 








rslozbl 1 


H C\ A A a 

191 lb 


o4o004dd 


-/I 


0.14 


0.15 


A "7yl r» 

0.749 


rs132o1z 


A AOCO 

19358 


34800708 


O/T 


0.23 


0.24 


As "yorv 

0.780 


rsiouoY9U 


z03U0 


O >|OA4 Cf~\ 


A/Ca 


untyped 


0.16 


NA 


rszoUoo 


zUbby 


34o02019 


A /T 
A/ I 


0.32 


0.35 


0.395 


rsi Obi bl 


OAOA4 

z0o91 


O OAOH A A 

o4o(Jzz41 




0.78 


0.71 


ft ftOT 


rsiozbio 


O A A C A 

zi4o1 


O A OAOOA <4 

o4o0zo01 


rr 


0.78 


0.82 


0.245 


L rslozbl 4 


zi9/o 




C/T 


0.17 


n a 0 
0.13 


0.167 


-4 OOC5 4 C 

rsi ozbl o 


0070C 
ZZrOO 


o4oU41 oo 




0.31 


n on 
0.30 


rv cru 


rslozbl r 


z4z4o 


o4ouooyo 


/-» rr- 
0/ I 


O.ob 


n oc 
0.OO 


rv oco 


rsoobo / z4 


O i(77n 

z4r AJ 


o4BUb1 zU 




0.b4 


n e"7 

O.b/ 


rv 00 yl 

0.384 


rszuiybo/ 


O /I Q A A 

z4o44 


v34oUb1 94 




0.1 7 


n a 0 
O.lo 


A "7 ^ A 

0.745 


-^O OCC70C 

rs3ob57^5 


^50bb 


3480641 6 


o/T 








rszU 1 93b4 


zouyb 


o a one /i /i c 
o4oUb44b 


U/ I 


0.38 


n on 
0.39 


rv "7rvo 
0.702 


rszUUoooo 


zoouy 


o4oUbbt>y 




n i 0 
U.lo 


n 10 
U.lo 


rv qci 


rsoyobUUz 


zoo44 


o4oUbb94 


A //^» 

A/L. 








rsoooo94z 


z55z9 


*34o0bo79 


a rr 
A/ I 








rsooooy4o 


zoooV 


o4oUboo/ 


A /O 








rSoooo944 


Z0004 


o4oUb9U4 


A/U 








rsi ozbl o 


z79b3 


o4oU9o1o 


a rr 
A/T 


A» yl y< 

0.44 


n Af\ 
0.40 


rv oorv 


rci OOtf!-l Q 

rsi ozbl y 


zol o4 


*2A QCSQA QA 

o4oljy4o4 










rsooz / o4b 


zooob 


q y* ono7nc 
o4oUy /Ub 


A/o 


rv Dyf 

O.W4 


u.ob 


U.ODO 


rsi ozozu 


zyo4o 


o4oiuyyo 


/A 

-/A 


rv *3rv 


r\ oi 
U.ol 


rv ccc 
U.DDO 


rsi ozozi 


zyyoo j 


o4o1 1 oob 


A/o 


n q 0 


n oo 
U.OZ 


rv rvee 

u.yoo 


rsouOf o 


oUzl / 


QylQi -1 CC7 

o4o1 loo/ 




rv oc 


n 07 
U.z / 


rv (no 


rsouo/ b 


oUzb f 


o4o 1 1 b 1 / 


a /r* 


0 00 

U 7z 


n on 

u.zy 


n (\*>a 
U.U^4 


rsouo// / 


oUol o 


o4oi iboo 


A/o 


rv 07 


U.zl 


rv rv70 
U.U/^ 


rsouo / o 


ouooo 


o4o i iyoo 


0/ 1 


n En 
U.OU 


U.4b 


rv oeo 


rsouo / y 


OU / Z4 


1AQ* 1CY7A 
O40 1 ZU / 4 


A/U* 


rv 00 


n oa 


rv con 

u.Djy 


rsouoou 


ouoy / 


Q/iQl OOA7 
0401 ZZ4 / 


r*/r 
L// 1 


rv 00 


U.OZ 


rv Q70 

u.y /d 


rSI OCX>£.£. 


ouyoi 


040lZZC3l 




U.oU 


U.oz 


rv Trvi 
U. / U1 


1 o 1 0^\J^O 


O 1 \JO\J 








n 

v/.OZ 




rs 132624 


31246 


34812596 


err 


0.36 


0.39 


0.393 


rs1 32625 


31373 


34812723 


A/T 








rs 132626 


31463 


34812813 


A/G 


0.86 


0.84 


0.723 


rs 132627 


31467 


34812817 


A/G 


0.14 


0.12 


0.672 


rs1 807672 


32188 


34813538 


G/T 


0.31 


0.31 


0.940 


rs 132628 


32288 


34813638 


C/T 


0.26 


0.23 


0.307 


rs1 32629 


32520 


34813870 


A/T 


0.05 


0.07 J 


0.493 


rs1 32630 


32594 


34813944 


A/C 


0.76 


0.80 


0.269 


rs1 32631 


32657 


34814007 


A/C 


0.74 


0.67 


0.058 


rs 132632 


32677 


34814027 


A/G 


0.64 


0.68 


0.316 


rs1 32633 


32764 


34814114 


C/T 


0.33 


0.34 


0.757 
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dbSNP 
rs# 


Position in 
SEQ ID NO: 
1 


Chromosome 
Position 


A1/A2 
Allele 


F A2 
Case AF 


F A2 
Control AF 


F p- 
Value 


rs1 32634 


32784 


34814134 


A/G 


0.45 


0.43 


0.574 


rs 132635 


32830 


34814180 


err 


0.42 


-0.01 




rs1 32636 


32872 


34814222 


err 


0.40 


0.40 


0.856 


rs1 29603 


33121 


34814471 


A/C 








rs1 32637 


33348 


34814698 


G/T 


0.09 


0.09 


0.882 


rs3788518 


33952 


34815302 


C/G 


0.15 


0.19 


0.133 


rs1 32638 


34184 


34815534 


C/G 


0.58 


0.55 


0.324 


rs 132639 


34361 


34815711 


A/T 


0.13 


0.16 


0.210 


rs1 32640 


35026 


34816376 


A/G 


0.34 


0.28 


0.075 


rs1 32641 


35192 


34816542 


A/G 


0.49 


0.47 


0.539 


rs1 32642 


35600 


34816950 


A/T 


0.14 


0.19 


0.079 


rs1 32643 


36033 


34817383 


C/T 


0.47 


0.45 


0.565 


rs1 32644 


36289 


34817639 


C/T 


0.57 


0.51 


0.156 


rs1 32645 


38869 


34820219 


A/G 


0.17 


0.23 


0.076 


rs20 17329 


39629 


34820979 


A/T 


0.39 


0.42 


0.531 


rs739198 


40530 


34821880 


err 


0.64 


0.05 




rs1 32647 


41621 


34822971 


err 


0.25 


0.22 


0.339 


rs2097465 


42379 


34823729 


err 


0.54 


0.50 


0.311 


rs2105915 


42802 


34824152 


C/T 


0.58 


0.56 


0.667 


rs1 32648 


42865 


34824215 


T/C 








rs1 32649 


43644 


34824994 


A/G 


0.21 


0.19 


0.471 


rs1 32650 


45051 


34826401 


C/T 


0.31 


0.34 


0.365 


rs 132651 


45828 


34827178 


A/C 








rs 132652 


45829 


34827179 


A/T 








rs80584 


46257 


34827607 


C/T 








rs 132653 


47286 


34828636 


A/C 


0.15 


0.18 


0.404 


rs916334 


47427 


34828777 


C/T 


0.36 


0.33 


0.383 


rs 132654 


47963 


34829313 


C/T 


0.53 


0.60 


0.090 


rs 132655 


48013 


34829363 


C/T 


0.39 


0.44 


0.150 


rs1 32656 


48229 


34829579 


C/T 


0.36 


0.39 


0.507 


rs3834684 


48282 


34829632 


-/A 


0.21 


0.21 


0.956 


rs1 32657 


48376 


34829726 


-/G 


0.51 


0.45 


0.112 


rs916335 


48404 


34829754 


A/G 


0.36 


0.40 


0.245 


rs132659 


49900 


34831250 


C/T 








rs 132660 


52699 


34834049 


G/T 


0.33 


0.41 


0.044 


rs1 32661 


52897 


34834247 


A/G 


0.29 


0.27 


0.624 


rs 132662 


53414 


34834764 


A/G 


0.59 


0.59 


0.998 


rs 132663 


53487 


34834837 


A/T 


0.27 


0.26 


0.792 


rs 132664 


54112 


34835462 


G/T 


0.28 


0.28 


0.934 


rs 132667 


55492 


34836842 


A/G 


0.36 


0.42 


0.054 


rs1 32670 


59766 


34841116 


C/T 








rs1 32671 


60307 


34841657 


A/G 


0.50 


0.49 


0.839 


rs 132672 


60701 


34842051 


A/G 


0.21 


0.25 


0.297 


rs 132673 


60952 


34842302 


A/G 


0.42 


0.40 


0.525 


rs 132674 


61401 


34842751 


C/T 


0.33 


0.32 


0.909 


rs 132675 


62379 


34843729 


C/T 


0.38 


0.36 


0.722 


rs80585 


62870 


34844220 


C/T 


0.27 


0.30 


0.405 


rs80586 


62879 


34844229 


A/G 


0.68 


0.62 


0.1 13 


rs1 32676 


63499 


34844849 


A/T 


0.25 


0.27 


0.442 


rs1 32677 


64284 


34845634 


-/A 


0.69 


0.64 


0.147 


rs1 32678 


64408 


34845758 


A/G 


0.43 


0.45 


0.620 


rs1 32680 


64760 


34846110 


A/G 


0.19 


0.23 


0.258 


rs1 32681 


65230 


34846580 


A/G 


0.23 


0.28 


0.201 


rs 132683 


66127 


34847477 


A/G 


0.19 


0.21 


0.493 


rs2269594 


66634 


34847984 


C/T 


0.68 


0.69 


0.866 


rs1 32684 


66686 


34848036 


A/G 


0.21 


0.24 


0.487 


rs 132685 


66694 


34848044 


C/G 


0.29 


0.29 


0.888 
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dbSNP 
rs# 


Position in 
SEQ ID NO: 
1 


Ct\ romosonip 

v_> u I uiiiuavlllv 

Position 


A1/A2 
Allele 


F A2 

Case AF 


F A2 
Control AF 


F D- 

Value 


rs1 32686 


67113 


34848463 


A/G 


0.45 


0.44 


0.708 


rs1 32687 


67257 


34848607 


A/G 


0.96 


0.93 


0.210 


rs1 32688 


67403 


34848753 


A/G 


0.22 


0.23 


0.917 


rs1 32689 


67609 


34848959 


A/G 


0.63 


0.62 


0.839 


rs1 32690 


68418 


34849768 


-/A 


0.16 


0.19 


0.295 


rs1 32691 


68610 


34849960 


C/G 


0.53 


0.50 


0.441 


rs1 32692 


69629 


34850979 


err 


0.64 


0.59 


0.223 


rs1 32693 


70024 


34851374 


A/G 


0.57 


0.52 


0.191 


rs1 32694 


70848 


34852198 


A/G 


0.17 


0.20 


0.413 


rs1 32695 


71428 


34852778 


C/G 


0.23 


0.23 


0.928 


rs1 966266 


71553 


34852903 


C/T 


0.49 


0.49 


0.881 


rs1 966267 


71633 


34852983 


A/G 


0.41 


0.38 


0.400 


rs1 06808 


71768 


34853118 


A/C 


0.71 


0.63 


0.030 


rs1 32696 


71769 


34853119 


A/G 








rs2239829 


73039 


34854389 


A/G 


0.35 


0.33 


0.665 


rs2285154 


73325 


34854675 


A/G 








rs2239830 


73412 


34854762 


A/C 


0.51 


-0.02 




rs2239831 


73547 


34854897 


C/T 


0.51 


0.53 


0.577 


rs3865722 


73769 


34855119 


A/T 


0.56 


0.57 


0.887 


rs3865723 


73806 


34855156 


A/G 


0.61 


0.60 


0.837 


rs3985996 


74467 


34855817 




0.30 


0.27 


0.501 


rs3985997 


74472 


34855822 


C/T 


0.89 


0.92 


0.287 


rs3985998 


74473 


34855823 


A/G 








rs3985999 


74482 


34855832 


C/T 


0.18 


0.17 


0.691 


rs3986000 


74494 


34855844 


A/C 








rs24 13382 


74592 


34855942 


A/G 


0.60 


0.62 


0.642 


rs2413383 


74670 


34856020 


G/T 








rs24 13384 


74672 


34856022 


G/T 








rs24 13385 


74714 


34856064 


G/T 


0.70 


0.72 


0.433 


rs24 13386 


74723 


34856073 


A/T 


0.70 


0.72 


0.510 


rs 1894606 


74749 


34856099 


A/G 








rs9 16336 


74861 


34856211 


C/G 








rs9 16337 


74892 


34856242 


C/T 








rs9 16338 


74893 


34856243 


C/T 


0.39 


0.43 


0.334 


rs1 32697 


75176 


34856526 


A/G 


0.60 


0.57 


0.463 


rs12781 


75705 


34857055 


A/G 








rs 1053983 


75989 


34857339 


A/G 


0.43 


0.39 


0.279 


rs1 053982 


76027 


34857377 


A/G 


0.74 


0.74 


0.852 


rs2227167 


77949 


34859299 


A/G 


0.35 


0.35 


0.998 


rs2227168 


77974 


34859324 


C/T 


0.38 


0.42 


0.226 


rs1 32700 


78167 


34859517 


C/T 


0.25 


0.25 


0.989 


rs3075364 


78310 


34859660 


-/CT 


0.41 


0.37 


0.252 


rs2227169 


78415 


34859765 


C/T 


0.36 


0.38 


0.520 


rs2097466 


78575 


34859925 


C/T 


0.52 


0.57 


0.255 


rs2097467 


78590 


34859940 


C/T 








rs2413387 


78709 


34860059 


C/T 








rs1 32701 


78875 


34860225 


C/T 


0.23 


0.27 


0.255 


rs1 32702 


79864 


34861214 


C/T 








rs1 32703 


81316 


34862666 


C/T 








rs2269595 


81320 


34862670 


A/G 


0.18 


0.19 


0.672 


rs2269596 


81409 


34862759 


C/T 


0.39 


0.43 


0.258 


rs1 32704 


81737 


34863087 


C/T 


0.67 


0.64 


0.391 


rs2007468 


81843 


34863193 


A/G 


0.41 


0.37 


0.247 


rs1 32705 


82102 


34863452 


C/T 


0.32 


0.33 


0.785 


rs2007706 


82833 


34864183 


C/T 


0.36 


0.41 


0.226 


rs1 32706 


83461 


3486481 1 


C/T 








rs1 32707 


83624 


34864974 


C/T 


0.27 


0.26 


0.598 
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dbSNP 


Position in 


Chromosome 


A1/A2 


F A2 


F A2 


Fp- 


rs# 


SEQ ID NO: 
1 


Position 


Allele 


Case AF 


Control AF 


Value 


rs1 32708 


83660 


34865010 


C/G 


0.31 


0.30 


0.754 


rs1 32709 


83701 


34865051 


ATT 


0.43 


0.45 


0.627 


rs1 32710 


83708 


34865058 


A/G 


0.63 


0.60 


0.482 


rs1 32711 


83782 


34865132 


C/T 








rs132712 


85707 


34867057 


A/G 


0.87 


0.86 


0.667 


rs132713 


85717 


34867067 


A/G 


0.28 


0.24 


0.293 


rs132714 


86486 


34867836 


C/T 








rs132716 


86833 


348681 83 


A/G 








rs132717 


87115 


34868465 


C/T 


0.47 


0.49 


0.533 


rs132718 


87234 


34868584 


A/G 


0.68 


0.66 


0.705 


rs132719 


87479 


34868829 


G/T 


0.62 


0.57 


0.223 


rs 132720 


87561 


3486891 1 


A/G 


0.12 


0.11 


0.727 


rs1 32721 


87604 


34868954 


A/G 


0.75 


0.77 


0.495 


rs 132722 


87674 


34869024 


C/T 


0.21 


0.20 


0.630 


rs1 32723 


87958 


34869308 


A/G 








rs 132724 


87992 


34869342 


-/G 








rs1 32725 


88019 


34869369 


A/G 


0.69 


0.00 




rs 132726 


88074 


34869424 


C/G 


0.12 


0.10 


0.509 


rs1 32727 


88079 


34869429 


C/G 


0.68 


0.70 


0.583 


rs1 32728 


88115 


34869465 


A/G 








rs 132729 


88118 


34869468 


C/G 








rs 132730 


88120 


34869470 


A/G 








rs 132731 


88135 


34869485 


-/CTCAT 








rs1 32732 


88142 


34869492 


G/T 








rs 132733 


88143 


34869493 


G/T 








rs1 40575 


88149 


34869499 


ACA/TG 


0.30 


0.29 


0.903 


rs1 32734 


88340 


34869690 


A/G 


0.47 


0.49 


0.571 


rs1 32735 


88344 


34869694 


G/T 


0.88 


0.90 


0.595 


rs80587 


88512 


34869862 


C/G 


0.37 


0.33 


0.365 


rs 132736 


88521 


34869871 


C/T 








rs1 32737 


88650 


34870000 


C/G 


0.31 


0.29 


0.686 


rs1 32738 


88827 


34870177 


C/T 


0.82 


0.81 


0.773 


rs1 807673 


89230 


34870580 


A/G 


0.26 


0.23 


0.356 


rs20 14700 


89236 


34870586 


A/G 


0.93 


0.96 


0.324 


rs 132739 


90754 


34872104 


G/A 








rs1812023 


90984 


34872334 


A/G 


0.71 


-0.01 




rs1812024 


91110 


34872460 


A/G 


0.70 


0.70 


0.888 


rs2005590 


92026 


34873376 


C/T 


0.69 


0.70 


0.719 


rs1 32740 


92954 


34874304 


C/T 


0.08 


0.09 


0.442 


rs3986001 


93375 


34874725 


-/TTGC 


0.53 


0.55 


0.563 


rs2413390 


93794 


34875144 


C/T 


0.40 


0.43 


0.365 


rs 132743 


94937 


34876287 


C/G 


0.07 


0.09 


0.436 


rs1 32744 


95068 


34876418 


C/T 


0.74 


untyped 




rs2413391 


96188 


34877538 


A/G 


0.43 


0.41 


0.611 


rs1 32749 


97092 


34878442 


C/T 


0.47 


0.48 


0.858 


rs1 32750 


98812 


34880162 


C/T 


0.70 


0.67 


0.285 


rs1 32741 


not mapped 


not mapped 


A/C 


0.30 


0.30 


0.999 


rs24 13388 


not mapped 


not mapped 


A/T 


0.30 


0.30 


0.989 


rs2413389 


not mapped 


not mapped 


C/G 


0.28 


0.26 


0.555 



[0231] Allelotyping results were considered particularly significant with a calculated p-value of less 
than or equal to 0.05 for allelotype results. These values are indicated in bold. The allelotyping p-values 
were plotted in Figure 1 for the discovery cohort. The position of each SNP on the chromosome is 
presented on the x-axis. The y-axis provides the negative logarithm (base 1 0) of the p-value comparing 
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the estimated allele in the case group to that of the control group. The minor allele frequency of the 
control group for each SNP designated by an X or other symbol on the graph in Figure 1 can be 
determined by consulting Table 13. For example, the left-most X on the left graph is at position 
3478 1551. By proceeding down the Table from top to bottom and across the graphs from left to right the 
allele frequency associated with each symbol shown can be determined. 

[0232] To aid the interpretation, multiple lines have been added to the graph. The broken horizontal 
lines are drawn at two common significance levels, 0.05 and 0.01 . The vertical broken lines are drawn 
every 20kb to assist in the interpretation of distances between SNPs. Two other lines are drawn to 
expose linear trends in the association of SNPs to the disease. The light gray line (or generally bottom- 
most curve) is a nonlinear smoother through the data points on the graph using a local polynomial 
regression method (W.S. Cleveland, E. Grosse and W.M. Shyu (1992) Local regression models. Chapter 
8 of Statistical Models in S eds J.M. Chambers and T.J. Hastie, Wadsworth & Brooks/Cole.). The black 
line provides a local test for excess statistical significance to identify regions of association. This was 
created by use of a lOkb sliding window with lkb step sizes. Within each window, a chi-square 
goodness of fit test was applied to compare the proportion of SNPs that were significant at a test wise 
level of 0.01, to the proportion that would be expected by chance alone (0.05 for the methods used here). 
Resulting p-values that were less than 10" 8 were truncated at that value. 

[0233] The exons and introns of the genes in the covered region are plotted below each graph at the 
appropriate chromosomal positions. The gene boundary is indicated by the broken horizontal line. The 
exon positions are shown as thick, unbroken bars. An arrow is place at the 3' end of each gene to show 
the direction of transcription. 

Example 5 

In Vitro Production of Target Polypeptides 

[0234] cDNA is cloned into a pIVEX 2.3-MCS vector (Roche Biochem) using a directional cloning 
method. A cDNA insert is prepared using PCR with forward and reverse primers having 5' restriction 
site tags (in frame) and 5-6 additional nucleotides in addition to 3' gene-specific portions, the latter of 
which is typically about twenty to about twenty-five base pairs in length. A Sal I restriction site is 
introduced by the forward primer and a Sma I restriction site is introduced by the reverse primer. The 
ends of PCR products are cut with the corresponding restriction enzymes {i.e., Sal I and Sma I) and the 
products are gel-purified. The pIVEX 2.3-MCS vector is linearized using the same restriction enzymes, 
and the fragment with the correct sized fragment is isolated by gel-purification. Purified PCR product is 
ligated into the linearized pTVEX 2.3-MCS vector andii. coli cells transformed for plasmid 
amplification. The newly constructed expression vector is verified by restriction mapping and used for 
protein production. 
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[0235] E. coli lysate is reconstituted with 0.25 ml of Reconstitution Buffer, the Reaction Mix is 
reconstituted with 0.8 ml of Reconstitution Buffer; the Feeding Mix is reconstituted with 10.5 ml of 
Reconstitution Buffer; and the Energy Mix is reconstituted with 0.6 ml of Reconstitution Buffer. 0.5 ml 
of the Energy Mix was added to the Feeding Mix to obtain the Feeding Solution. 0.75 ml of Reaction 
Mix, 50 ul of Energy Mix, and 10 ug of the template DNA is added to the E. coli lysate. 

[0236] Using the reaction device (Roche Biochem), 1 ml of the Reaction Solution is loaded into the 
reaction compartment. The reaction device is turned upside-down and 1 0 ml of the Feeding Solution is 
loaded into the feeding compartment. All lids are closed and the reaction device is loaded into the 
RTS500 instrument. The instrument is run at 30°C for 24 hours with a stir bar speed of 1 50 rpm. The 
prVEX 2.3 MCS vector includes a nucleotide sequence that encodes six consecutive histidine amino 
acids on the C-terminal end of the target polypeptide for the purpose of protein purification. Target 
polypeptide is purified by contacting the contents of reaction device with resin modified with Ni + ions. 
Target polypeptide is eluted from the resin with a solution containing free Ni 2+ ions. 

Example 6 

Cellular Production of Target Polypeptides 

[0237] Nucleic acids are cloned into DNA plasmids having phage recombination cites and target 
polypeptides are expressed therefrom in a variety of host cells. Alpha phage genomic DNA contains 
short sequences known as attP sites, and E. coli genomic DNA contains unique, short sequences known 
as attB sites. These regions share homology, allowing for integration of phage DNA into E. coli via 
directional, site-specific recombination using the phage protein Int and the E. coli protein IHF. 
Integration produces two new att sites, L and R, which flank the inserted prophage DNA. Phage excision 
from E. coli genomic DNA can also be accomplished using these two proteins with the addition of a 
second phage protein, Xis. DNA vectors have been produced where the integration/excision process is 
modified to allow for the directional integration or excision of a target DNA fragment into a backbone 

vector in a rapid in vitro reaction (Gateway™ Technology (Invitrogen, Inc.)). 

[0238] A first step is to transfer the nucleic acid insert into a shuttle vector that contains attL sites 
surrounding the negative selection gene, ccdB (e.g. pENTER vector, Invitrogen, Inc.). This transfer 
process is accomplished by digesting the nucleic acid from a DNA vector used for sequencing, and to 
ligate it into the multicloning site of the shuttle vector, which will place it between the two attL sites 
while removing the negative selection gene ccdB. A second method is to amplify the nucleic acid by the 
polymerase chain reaction (PCR) with primers containing attB sites. The amplified fragment then is 
integrated into the shuttle vector using Int and IHF. A third method is to utilize a topoisomerase- 
mediated process, in which the nucleic acid is amplified via PCR using gene-specific primers with the 5 ' 
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upstream primer containing an additional CACC sequence (e.g., TOPO® expression kit (Invitrogen, 
Inc.)). In conjunction with Topoisomerase I, the PCR amplified fragment can be cloned into the shuttle 
vector via the attL sites in the correct orientation. 

[0239] Once the nucleic acid is transferred into the shuttle vector, it can be cloned into an expression 
vector having attR sites. Several vectors containing attR sites for expression of target polypeptide as a 
native polypeptide, N-fusion polypeptide, and C-fusion polypeptides are commercially available (e.g. , 
pDEST (Invitrogen, Inc.)), and any vector can be converted into an expression vector for receiving a 
nucleic acid from the shuttle vector by introducing an insert having an attR site flanked by an antibiotic 
resistant gene for selection using the standard methods described above. Transfer of the nucleic acid 
from the shuttle vector is accomplished by directional recombination using Int, IHF, and Xis (LR 
clonase). Then the desired sequence can be transferred to an expression vector by carrying out a 
one hour incubation at room temperature with Int, IHF, and Xis, a ten minute incubation at 37°C with 
proteinase K, transforming bacteria and allowing expression for one hour, and then plating on selective 
media. Generally, 90% cloning efficiency is achieved by this method. Examples of expression vectors 
are pDEST 14 bacterial expression vector with att7 promoter, pDEST 15 bacterial expression vector with 
a T7 promoter and a N-terminal GST tag, pDEST 17 bacterial vector with a T7 promoter and a N- 
terminal polyhistidine affinity tag, and pDEST 12.2 mammalian expression vector with a CMV promoter 
and neo resistance gene. These expression vectors or others like them are transformed or transfected into 
cells for expression of the target polypeptide or polypeptide variants. These expression vectors are often 
transfected, for example, into murine-transformed a adipocyte cell line 3T3-L1, (ATCC), human 
embryonic kidney cell line 293, and rat cardiomyocyte cell line H9C2. 

Nucleotide and Amino Acid Sequence Embodiments 

[0240] Table A includes information pertaining to the incident polymorphic variant associated with 
osteoarthritis identified herein. Public information pertaining to the polymorphism and the genomic 
sequence that includes the polymorphism are indicated. The genomic sequences identified in Table A 
may be accessed at the http address www.ncbi.nih.gov/entrez/query.fcgi, for example, by using the 
publicly available SNP reference number (e.g. , rs 1 32659). The chromosome position refers to the 
position of the SNP within NCBI's Genome Build 34, which may be accessed at the following http 
address: www.ncbi.nlm.nih.gov/mapview/map_search.cgi?chr=hum_chr.inf&query=. The "Contig 
Position" provided in Table A corresponds to a nucleotide position set forth in the contig sequence (see 
"Contig Accession No."), and designates the polymorphic site corresponding to the SNP reference 
number. The sequence containing the polymorphisms also may be referenced by the "Nucleotide 
Accession No." set forth in Table A. The "Sequence Identification" corresponds to cDNA sequence that 
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encodes associated target polypeptides (e.g., APOL3) of the invention. The position of the SNP within 
the cDNA sequence is provided in the "Sequence Position" column of Table A. If the SNP falls within 
an exon, the corresponding amino acid position (and amino acid change, if applicable) is provided as 
well. Also, the allelic variation at the polymorphic site and the allelic variant identified as associated 
with osteoarthritis is specified in Table A. All nucleotide and polypeptide sequences referenced and 
accessed by the parameters set forth in Table A are incorporated herein by reference. 



Table A 



RSJD 


Chrom- 
osome 


Chrom 
Position 


Contig 
Accession 
No. [1] 


Contig 
Posi- 
tion 


Nucleotide 
Acces- 
sion No. 
f21 


Sequence 
Position 


Amino 
Acid 
Posi- 
tion 


Locus 


Locus 
ID 


A [3] 


Allelic 
Variability 


OA 
Assoc. 
Allele 


rs 132659 


22 


34831250 


Hs22 11677 34: 
9 


15897265 


NM 014349 


mrna-utr 




AP0L3 


80833 


F 


cm 


C 



[1] Contig Accession Number which can be found in the NCBI Database: 
http address: www.ncbi.nih.gov/entrez/query.fcgi 

[2] Sequence Identification or Nucleotide Accession Number which can be found in the NCBI 
Database: 

http address: www.ncbi.nih.gov/entrez/query.fcgi 

T31 "A" column is the sequence orientation ("F" is forward. "R" is reverse). 

[0241] The following is a genomic nucleotide sequence for an APOL3 region. The following 
nucleotide representations are used throughout: "A" or "a" is adenosine, adenine, or adenylic acid; "C" or 
"c" is cytidine, cytosine, or cytidylic acid; "G" or "g" is guanosine, guanine, or guanylic acid; "T" or "t" is 
thymidine, thymine, or thymidylic acid; and "I" or "i" is inosine, hypoxanthine, or inosinic acid. Exons 
are indicated in italicized lower case type, introns are depicted in normal text lower case type, and 
polymorphic sites are depicted in bold upper case type. SNPs are designated by the following 
convention: "R" represents A or G, "M" represents A or C; "W" represents A or T; "Y" represents C or 
T; "S" represents C or G; "K" represents G or T; "V" represents A, C or G; "H" represents A, C, or T; 
"D" represents A, G, or T; "B" represents C, G, or T; and "N" represents A, G, C, or T. 

APOL3 Genomic Sequence (SEP ID NO: 1) 

>22: 34781351-34880400 

1 tacacttttt tttttttttt tgagacagag tcttgctcta ttgcccaggc tggagtgcag 

61 tggcactatc tcagctgact gcaacgcctc ctgggttcaa gctattttca tacctcagcc 

121 tcccaagtag ctgggattac aggtgtgtgc caccatgccc agctaatttt cgttttttaa 

181 gtagagacag ggttccacca Ygttggctag gctggtctca aactcctgac ctcaagtgat 

241 cctcctgcct tggcctccga aagtgctggg attacaggcg tgagccacca aacccggcct 

301 gatacacaat tgtctatggg agaagcacag tgaatgcact ctgatttagc aatcttattc 

361 tcttgaatgg tttgtgcttt ttatgtctta ggaaaccttc cttggtccca agatattcta 

421 tagaRacttt attttgtact tttcttctaa aagtcttttc tttttttttt ttttttgaga 
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481 cagagtctca ctccgttgcc caggctggag tgcagtggca cgatcttggc tcactgcaac 

541 ctccgcctcc tcggttcaag cgattctcct gcctcagcct ccagagtagc tgggactaca 

601 ggcatgcgcc accacgcctg gctaattatt gtatttttag tagagacggg gtttcaccat 

661 attggccagg ctagtcttga actcctgagc tcaagtgatc tgcccatctc ggcctcccaa 

721 agtgcaggga ttacaggcgt gagccaccac gcccagcctc taaaagtctt aacgttttgc 

781 ctttcacatt tacatctgga attgattttg ttatgcggtg aggaaagaat ctttttattt 

841 ttctctttgg ggactcaccc cccactccat ttgcaggcct gttgggaggt aggctcatct 

901 tagaggaaca gaccctgggc aggctgtgac acagggctcc agcaacaagc cgcagaagcg 

961 gaactcataa catgtggcat tggcattccc taccattact gattcaatgc cattccacgg 

1021 tggccctggg tcatggacca gctcatgttg ggctaatgtc ccaacacaca caaagatggt 

1081 aggtttgact aagcMagagg cagtccttag gtgaggttgc agggagaggg aagagggctt 

1141 ggtgaagaat agaagaccca tgaagaaggg gtgtggtctt ggtgaccacg tgatcccata 

1201 gtattcccat agacactcaa gacagctggt aaattgagct gctccccaga ggaggccaac 

1261 agacaggctg gacaatgact ggggtctgag cagagacccc gtctcccttc tgctgcacct 

1321 gctgcagctt aacattgact agactaccag gctcagcctc agcaaggccc ccacctggct 

1381 ctgggcctgg tgaccaggga aggtgggaag agagggggct catcacttcc tacagaagga 

1441 agctcagact tccaatttcg tgtggcataa gagcatgggc tctgggcaac actgcctgga 

1501 ttagtattca gctcgggccc tactcgtggt ataactggga gtcactaatt ttatctctgg 

1561 gcttcactat aaatgggatc ctaatggcag tctacccaag cagggtattg tgaggcctag 

1621 agatgatgag gcaaacagtt ggttctcaat acaagaaaga aatttatcat ttgcatgtag 

1681 ctgcatagct gataagccca tgacagtttt cccataacct gttttcatgg ccttattttc 

1741 cgcaactctg tctatttcag ctacgtggaa ccactcctgt gtactgtgac agtctaatgt 

1801 tgggatctgg ttaaatatat tatggcccat ccaggcgcgg tggctcacgc ctgtaacccc 

1861 agcactttga gaggccaaga caggcgaatc accctgaggt caggagttcg agaccagctt 

1921 ggccaagatg gtgaaacccc gtctctacta aaaatacaaa aattagctga gtgtggtggt 

1981 gcatgcctgt aatcccagct actcgggagg ctgaggcagg agaattgctt tgaaaccggg 

2041 aggtggaggt tgcattgagc caagattgca ccactgcact ccaacttggg cgacagacag 

2101 agagagactc ggtctcaaaa aaaaaaaaaa aaaaagttat ggcccatcca caggatgcaa 

2161 tatcatatag ccatttaaaa gaatgggcta catctgaacc Mgatggaaga aaagatgccc 

2221 ggatatgtta ttaatgggat gaggcaacgt gtagaacctg tgtaaaatag gatttttttt 

2281 ctgagatgga gtcttgctct gtcacccagg ctggagtgca ttggtgccat ctctcctcac 

2341 tgcaacctct gcctcctggg ttcaagcgat tctcctgcct cagcctccca agtagctggg 

2401 attataggag cccgccacca catctggcta gtttttgtat ttttagtaga gacgaggttt 

24 61 caccatgttg gccaggctgg tcttgaactc ctgatctcag gtgatccacc cacctcggcc 

2521 tcccaaagtg ctgggattac aggtgtgagc caccacgcct ggccaagtag gatgctttta 

2581 tgccatattt aaaagaatcc acagggagct gttaatgctg atacttttgg ggaaagggct 

2641 cagtggagca gctaagcttt tgcttttctt tttgttcctt tctgtggaat tggcattgcc 

2701 taccaacact gattcagtgt cattccacgg tggccctggg gcatggtgtt ttttttcatg 

2761 gatcagctct gcctgacacc accatgcagc aggctgtact gtgagcctca ttttgcaggt 

2821 gagagaagca aggcacagag aggttacgta acttgcttaa agacatgtga caagtaagag 

2881 gtggagctgg gattcaaact caggcagtgg aggtctggag tccctatttt tagctactac 

2941 atgacactgt ctcccaatat attattttta tctatttaaa aattcaaagc agttttctgt 

3001 gttatagact attttttact tcttcacatg cttaaaaaca aacaaacaaa aaaaaataaa 

3061 acagaggcca ggcacggtgg ctcacgcctg taatcccagc actttggaag gccgaggaag 

3121 gtggattacc tgaggtcagg agttcgagac cagcctggcc aacatggtga aaccctgtct 

3181 ttactaaaaa tacaaaaaat agccaagtgt ggtggcatgc gcctgtagtc ccagctactc 

3241 gggaggctga agcaggagaa tcgcttgaac ctgggagaca gagtttgcag tgagctgaga 

3301 ttgtgtcatt gtactccagc ctgggcgaca gagagacttc atctcaaaaa aacaaaacaa 

3361 aacaaaacaa aacaaaaaaa aacaggaaca atgcatagcc catgatgaat tgttttatta 

3421 tctgtgactc ttattgttca ttggttgctt ctgtccctct ctgaccacag cctgatggga 

3481 ggcacaggcg agggaagaac gtggctttgg gagcaggggg accctcattc tcatcccagc 

3541 tctgcacgaa ctgcggcatg accttggcca ggccccttcc cttccttggc ctcagcctcc 

3601 atctgtagag ctgggccaat gccggagcca atgacctctg cgggccctcc tgcacaagga 

3661 ttctctccct ctttctgatc tatttttgcc tgacgcctct cccaggggcc tgtgggcctg 

3721 agctaatttt cctggctggc cccagtccag gcagctcagg tggagagcgc caagctaatg 

3781 aggccaaggt catggtccag ttagcatctc tcggaggaac cgtggctttt cagccagact 

3841 gtagctccct gaccagtcac tgcgctcagg tagaggcggg ccctagagga gaggacaata 

3901 gcaaagctgg atcctgcaag tcagcatttc cagtaggagc tgctcagggc ccagggccca 

3961 gatacccttt gtgactgcag ggcagcatgt ccaggctcac atcaacatac caggatgatt 

4021 caaggagtgt gtacgttggg ggttagggag aggaccaggt ggatcaccaa gtgcctgcaa 

4081 gtacccccac acctgtcaag tcaggaagat gagaaccaat tctaccttta aatctaaagc 

4141 agattgggag gccgaggcag gaggatcacc tgaggtcagg agtttgagac cagcgtgacc 

4201 aacatggtga aaccccatct ctactaaaaa tataaaaatt agccaggcat ggtggcgggc 

4261 gccacctccc agctactcag gaggctgtgg cgtgagaatc gcttgaacct gggaggcgaa 
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4321 ggttgcattg agccaagatc atgccattgc actccagcct gggggataga gcgagactct 

4381 gtctcaaaaa acaaaacaaa acaaaacaaa caatctatat ctatatctat atctatctat 

4441 ctatctatct atctatctat ctatctatct atctatataa agcagacagc atgtccacat 

4501 tctctcaagg tgaccccctc tttctaacat cagaactttc cttttttcta catgaggcat 

4561 ttccaccact tttaaatcag atgtaagaca gagtaagcct tcctctttta cctgccacga 

4621 gggccagggg ttgtctgttc cttctctact acatccccgg cacctagacc catgcctggc 

4 681 gtacctacac taggagctca gtcaatattg ttgagcgaat caaatcacca ctgttccaga 

4741 atgaggctgc agactgcctt tgaacggaca cagtattgcc agtgggtttg actcagtttt 

4801 gttcaaatat ttcaatttct ttttttggtt tgtttttctg agggagtctc actctgttgc 

4861 ccaggctggc gtgcagtagc gtgatctcgg tcggctcact gcaacctcca cctcctgggt 

4921 tcaggcgatt ctcctgcctc agcctcccaa gtagctgggg ctacaagcat gcgccaccac 

4981 acctggctaa tatttttgta tttttagtag atacagggtt tcaccatgtt ggtcaggctg 

5041 gtctcaaact actgacctca aatgatccac ctgcctcagt ctccgaaagt gctgggatca 

5101 caggcgtgag tcaccgtgcc tggcctcaga tgcttctatt tcaacgagtc tggtgaagct 

5161 tgttatttag agagtaatat gtgaatgctc tagaaaaagc aattctcctg tctcagcctc 

5221 ccaagtagct ggcactacag gtgcccacca ccatgcctgg ttaagttttt gatttttact 

5281 agagacaagg ttttaccatg ttggccaggc tggtctcgaa ctcctgactt caagtgatct 

5341 accctccttg gcctcccaaa gtgctgggat tagaggaatg agccaccaca cttggccaaa 

5401 ttattccttt tacaatctgc tacaaccccc tgatttgaac attttgcaaa tttgaatgtt 

54 61 ctcattaggt tttcgattca gttcagtgaa tgtttattat taagtattgt tattatatta 

5521 ttattcatta ttgttattat ttaataagaa taattgagtt cttaccataa gccagacttt 

5581 gggttaaaca cataacatgt agatttagtt aattttcatg acaaatctat aagatgggca 

5641 ctactgctat cctactttac agatgagaaa actgaggcag acagaggtta agtaaattgc 

5701 tcaaccccac atgaatagta agttctggag ccaggagtca aacttcagag cctacactcc 

5761 taatcacagc taatgttgct cttcaggacg tactttacat gcaagtacag taactgtcta 

5821 caaggcactg gtgattaaaa aatgaacaaa gcctttaaca ggtgaatggt taagcagact 

5881 gtggcacatc cacatcatgg aacacaattc agcaacagca cggaatgaac actattgatg 

5941 cacacagtaa cctgaatgac tctccagaga attatgccaa ctgcgaaaag gcaatccgca 

6001 agagttacgt actctatgat gccatttact tatttatttt gagacctagt ctcgctctgt 

6061 cacccaagct ggagtgcagt ggcatgatct cggctcactg taacctccgc ctcctgggtg 

6121 caagcgattt tcctgcctca gctttccaag tagctgggac tacaggcgtg caccaccacg 

6181 cccagctaat ttttgtattt ttagtagaga cagggtttta ccatgttggc cgggctggtc 

6241 tcgaacttct gacctcaaat gatccgccct ccttggcctc ccaaagtgct gggattacag 

6301 gcgtgagcca ccgtgcctgg aaaattattt atttttaaga tttgttaagt gtagtcatga 

6361 gaagcgggaa agagtagaac aaggagtttt atctgctgtg actgaacaat caattgagat 

6421 aacgcggcac cttcggatca gctgattcca tttatctaac attcttaaat ggacccactt 

6481 atagacatga agaatagatt agcaattgcc aaggctaaag tagggatcag ggtgggagag 

6541 aagtgggtga ggctataaaa ggacaacaag agggattctt gtggtgatag aaatattcta 

6601 tatcttgatt atatcaatgc caatatcctg gttgtgatat tgaagaggct aaagctgcat 

6661 ttggtacaat aaaatacaat tagcagccta taaccccaaa ggagaaaact ggccagttgt 

6721 gtattaaatc tttgtgtgtt tgcatatgtg acatttaaat gcttaattta tatgccaaca 

6781 tttagcattt ttccaagatg ttactattaa cggaaactga gtaaagggta catgggatct 

6841 ctctggatta ctttgtacaa atgcaaataa aaatgttaat ttaaaaaaac agaatgaggc 

6901 caggtacggt ggttcatgcc tgtaatccca gagctttggg agaccaaggt gggaggatcg 

6961 attgaggcca ggagttcaag accagcctgg ccaaccagca agaccccatc tctacaaaaa 

7021 attaaaaaat tcagctgggc atggtggcgt atgcctgtag tcctagctac tgcagaggct 

7081 gaggcaggaa aatcacttga gactgagagg tcgaggctgc agtgagcaat gatggtgcca 

7141 ctatactcta gcctgggcaa cagggcaaga gaatgaggca gacaggagcc cagcctggca 

7201 ggaaagggag gttgtattag tcagggttct ccagagagac agagctaata ggatagatgt 

7261 atatatgcag gggagtttat taaggagtat tgactcacac gaacacaagg tgaagtccca 

7321 caacaggctg ttttcaagct gattagcaaa gaagccagcc agagtctcaa aacctcaaaa 

7381 gtagggaagc tgacagcgca gcttttagtc tgtggccgga ggcctgagag cccctggcaa 

7441 accactggtg taagtccaag agtccaaaag ctgaagaact tggagtctga tgttcaaggg 

7501 caggaagcat ccagcacggg agaaagacca tggaagggag tgaaggctgg aagactcagc 

7561 aagtctagtc cttccaagtt cttctgcctg cttaattcta gccgtgttgg cagctgatta 

7621 gatggtgccc acccagattg agggtgcgtc tgcctctccc aggccactga ttcaaatgtt 

7681 aatctccttt ggccacaccc tcacagacac acccaggaac aatactttgc atccttcaat 

7741 ccaatcaagt tgacactcaa tattaactat cacagagggc ttccaatgca gtgacagttg 

7801 acatgagtcc tcaaggttag tttgcaggtc acaagggaga gaaggacagg aggcctgagg 

7861 caggcaggac agcatgtgRt agtgggctgt ttgggtcttc cttccccaaa gtcatcctgg 

7921 gatggtgaga tctccggcca ttcattccat tgcacagagg cattcatgca ggctacctcc 

7981 caacttgttc ataaaaatgt taagtgatgg acaaaataat gtctttacat ttttcttaaa 

8041 cgtttggttc ctgctgctgc ttatctggta gagaggctca tcagaggaga aaatgaagga 

8101 agattccctt gtcaaacatc acaacagaga gcaagcatct ttgttcagcc tgtcaggatc 
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8161 

8221 

8281 

8341 

8401 

8461 

8521 

8581 

8641 

8701 

8761 

8821 

8881 

8941 

9001 

9061 

9121 

9181 

9241 

9301 

9361 

9421 

9481 

9541 

9601 

9661 

9721 

9781 

9841 

9901 

9961 

10021 

10081 

10141 

10201 

10261 

10321 

10381 

10441 

10501 

10561 

10621 

10681 

10741 

10801 

10861 

10921 

10981 

11041 

11101 

11161 

11221 

11281 

11341 

11401 

11461 

11521 

11581 

11641 

11701 

11761 

11821 

11881 

11941 



atctttcaca 
tttcttttta 
tgataacagc 
atctcaattc 
ggctatggtc 
Ygaataggcc 
ccaaaaaacc 
actcattgat 
gtcacccagg 
ttcaaatgat 
cacctggcta 
tcttgaactc 
agttgtgagc 
catctaagga 
cagcagcttt 
cctaggctgg 
cggatcacct 
tattaaaaat 
gaggctgagg 
gcgccattgc 
aactaaacaa 
cccgatctca 
tggggcaggt 
ccactttctg 
attgctgtgg 
ggttacatat 
gaaaggggaa 
ttgaaaacca 
agaagccaga 
ggtcaatgtg 
cccagacaga 
tggcagtcac 
gtttgttttt 
ctgggctgat 
tcctgcttca 
ttctgtattt 
gcgatccgcc 
ggccagacgc 
tttccctggc 
gtgctctcaa 
ggctttgtgc 
caaggtcccg 
cctcctggct 
actgcagccc 
tggtctggtt 
tcttctccgg 
ccctcatccc 
tagcggaggc 
gggagcatca 
ctctactaaa 
tggggaggtt 
gattgagcca 
aaagtcttag 
tgagatctct 
cactcatggg 
tcggtgtctc 
ggcctattaa 
taagctgatt 
cctttcttgt 
aggctggaga 
agtgccccaa 
gaaattcagt 
tgctggggga 
gtataataaa 



ttcaatttag 
aaatgtttgt 
ctaggagtgg 
cacttagagg 
atgaccttga 
agtattgcca 
tagtatacct 
ttattttttt 
ctagagtgca 
tctcctgcct 
atttttatat 
ctgacctcag 
caccacgcct 
tttaaagctg 
gcttatatca 
gcacggtggc 
gaggtcagga 
tcaaaattag 
caggagaatc 
actccagcct 
aacaaaaaaa 
gtggttcaca 
gcctctctgc 
gaagattgct 
cttagaaatg 
tcccacccaa 
aagccagaca 
ctgcccagag 
ggggaatgag 
cagggctggg 
ccccggggcc 
aggctttcaa 
ttgagacgga 
tgcgcctcgg 
gcctcccgag 
ttagtacaga 
cgcccaggcc 
tgtgttttcc 
tccacccctc 
cttggcctcc 
ctctgtaaaa 
ggctgcggct 
ctgcYgaggc 
tcaaaattca 
agtttccatt 
acctccctcc 
cgctcttctc 
agggcgcaat 
cttgaggtca 
aatacaaaaa 
gaggtaggag 
ctgcactcca 
ctgaaacgtc 
ccccttcttg 
tcaagagcat 
cttgctggga 
acacgctggc 
caggaaagga 
ccccaggaca 
aggcacgctc 
gagctgctca 
gaaagtgtaa 
ataccaccca 
atacaattag 



aaaaatggca 
tttttttttc 
aaagaatgag 
ctatggtcat 
gtccgtttcc 
aatcccagtg 
tgtgtaatac 
attttattta 
gaggcgtgat 
gggcctcccg 
ttttagtaaa 
gtgatccgcc 
ggtcaataag 
tgtacctttc 
gccaggatag 
tcacgcctgt 
gtttgagacc 
ccgggcatgg 
gcttgaacct 
gggctactag 
caaaccaaac 
acagcaaagg 
tcYggttgtc 
ggttgccaca 
acacatatta 
ccataaggag 
aatgtggcca 
ccagaaggat 
gaggagccgg 
gccagaggaa 
attccgttct 
catggaaggt 
gtcttgctct 
caaaatctcg 
tagttgagat 
cggggtttca 
tcccaaagtg 
tggcactcca 
cggcgctgct 
ctggctcctt 
ctgctgacgt 
gcgcagcgcc 
tgttggcaac 
gcacgaggct 
ctgacccatg 
cagccacaag 
cccaaccttc 
ggcttacgcc 
ggagtttgag 
ttagctgggc 
aatcgcttgt 
gcctgggcaa 
acttcctcga 
ctggacgggt 
ttcctaacag 
cgctgctcac 
cttggtgaga 
gacatctcct 
gtggacatct 
aagaaacgtt 
tcagccccat 
gcatacaggc 
gggtttgaaa 
gcagagcacg 



gatgttaatg 

cccttactat 
agtttgggtg 
gacttagagg 
ctgagtttca 
ctaggatggg 
ctgacacgta 
ttttattttt 
ttcgacttac 
agtagctggg 
gacagggttt 
tgcctcagcc 
tgatactcat 
atagtgttct 
tctaggttat 
aatcccagca 
agcctgacca 
tggtgcatgc 
gggaggtgga 
agcgaaactc 
aaacaaacaa 
ctacttcttg 
ttccctcctg 
gtagaggaaa 
tttccattca 
accaggaaga 
acagccccag 
ggaggatcag 
ggtgcctggt 
ggctgggcgc 
gcaggcaaac 
ggagctgcct 
gtcacccagg 
gctgacttgc 
tacaggcgcg 
tcatgttggc 
ctgggattac 
cttcatgagc 
ggattcctga 
cctggtgcag 
caccagcagc 
gacagcacga 
acgcagggct 
tggctagggg 
ctagccgtct 
tctttgaata 
ttcatcatcc 
tgtaatccca 
accagtctgg 
gtggtggcgg 
acccgggagg 
cagagcgaga 
ctcctcaaat 
tggcaatgct 
cgaacagagg 
acactttgtg 
aagcctctgt 
tccttctctg 
gtttgccaat 
caaggcatga 
tttcttccta 
ataggtagta 
agatacagaa 
gtggctcatg 



tttttcctgt 
gcctagtcct 
acagagatcc 
ctatattaag 
ctgcctcaga 
gattaatgag 
gtaggtgctc 
ttgagaccga 
tgcaacctct 
ataacaggcg 
caccatgttg 
tcccaaagtg 
tgttaataac 
acaaacagct 
gctgcagtaa 
ctttgggagg 
acgtggtgaa 
ctgtagtcct 
gcttgcagtg 
tgtctcaaaa 
aaaccaccaa 
ctcacgctac 
gactcagtat 
agggtcttat 
aaactctttt 
ataatcccac 
tgactaccac 
catggcggta 
atggagggcc 
ctttctcatc 
aagccagtcc 
ggacgctgtg 
ctggagtgca 
ctcccgggtt 
caccaccacg 
cacKctggtc 
aggaataagc 
aaaactgtca 
acctattcac 
acatttactc 
ggctccgacc 
tgcgtgtgcc 
cgcagtgcgc 
agcttgagtg 
ttcagattct 
tgcggcctgc 
cttggccgat 
gcacttaggg 
ccaacatggt 
gcacctgtac 
cagaggttgc 
tcctgtctca 
ttagtcaagc 
ggaggatctt 
tttcagaatc 
atcctcagag 
gggtttcctg 
tttgccatgg 
gcagacctcg 
cagctaccca 
atattattta 
ggtaaactag 
ggcaaaagaa 
cctgtaattc 



taagagataa 
gctttttata 
ttacatttga 
accaYttaga 
ctaatcatac 
aaaacatttg 
aataagtgat 
gtcttactct 
acctcctggg 
tgtgccacca 
gccaggctgg 
ctgagattac 
aattctagtt 
tcctgatttc 
aaaacaagcc 
ccgaggcggg 
cccctgtctc 
agctagttgg 
agccaagatc 
aacaaaacaa 
aaagcaagcc 
atgtccatca 
catggagcaa 
atcagcattc 
gtccagaact 
tgtgcatctg 
cacagtgctt 
ggaggaaggg 
atccatcctt 
tggacccaga 
cgctagaatg 
ttttttttct 
gtggcatgat 
caagcgattc 
cccggctaat 
ctgacctcaa 
caccgcgccc 
cagagtagta 
ttcttctgaa 
ttcctgctcc 
gcctgtcctc 
gacagcgcca 
ccgcttttct 
gctgacactg 
agaactcagt 
agctgggctc 
cttgaagact 
aggccgaggc 
ggaactttgt 
tcccagctac 
tgtgagccag 
aaggaaaaaa 
ccccgtgcta 
gaaagaggaa 
ctcatgagca 
gtgagatgca 
tttatcccaa 
aggactcaca 
tgctcactgg 
gaaggcctct 
aaaatttaaa 
aattcactct 
actgcattta 
cagcactttg 
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12001 
12061 
12121 
12181 
12241 
12301 
12361 
12421 
12481 
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12841 
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12961 
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13081 
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13561 
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13681 
13741 
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13861 
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15661 
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15781 



ggaggccgag 
gtgaaacccc 
atcccagtta 
ggctgcaatg 
gtctagaaaa 
gctatgcact 
gcatgtatat 
catcaaataa 
cacttctatc 
tccagttcct 
ccactactgc 
ctgggggatc 
agagggaaag 
ccactccaac 
aatcccaaag 
ttcctccctc 
ctttcctccc 
tccctccttt 
ttcctctccc 
tcttccttcc 
tccttttctt 
tctccctcct 
ttccttccct 
tccttccttc 
tctctttctt 
actctgtcac 
ctcgggttga 
ccacaatgcc 
aggctggtct 
ggattacagc 
gattctattc 
gcttcgacaa 
tttggctttc 
tagctccagg 
aaagctggga 
ccacacaacc 
tcttccctca 
acctgctcag 
ttccctgcag 
ttgtatctca 
cagcactttg 
atccatgacc 
gtggtgcatg 
tggcaagcgg 
gagcaaaact 
gcagcatcct 
cacattaatg 
aggtctccag 
aaatgcacgt 
tgggaaccca 
gggaacaagg 
cttggacaag 
acttctttct 
taccccacaa 
cgccaacaac 
acacatactc 
acgcctgtaa 
ttgagaccag 
gggcgtggtg 
ttgaacccgg 
gcaacaagag 
aataaattgg 
gacagcccga 
gggtttccag 



gaaggcggat 
gtctctacaa 
tttgctgggc 
agctgtgttc 
aataaaaaaa 
aaatctgacc 
cgtttaaata 
tctataattc 
agtatccatc 
gcctggcctc 
tgagagcatc 
agcagaagag 
tgacttacct 
tcagtctcat 
ttgtttcctt 
tccctccttt 
ttcctctccc 
tcttccttcc 
tccttttctt 
tctccctcct 
ccttcctctc 
ttccttcctt 
ccctcccttg 
ttcccttcct 
tttccccctc 
caaggctaga 
agggagcctc 
tggctaattt 
caaactcctg 
catgagccac 
catccccaac 
acacagaggt 
tctcttttac 
agatcctgag 
tcatgctgca 
ccctcagtgt 
tcacctcaca 
tctcccatca 
gaaagaggag 
aaagaaatcc 
ggaggctgag 
atcatggaga 
cctgtaatcc 
aggttgtggt 
ctgtctcaaa 
aggggtagcc 
atgccaccca 
tcttccatgt 
ggcccacggc 
tgctgagcct 
actccaaaag 
ttacctaaac 
ttaaaggctt 
ggatcacagg 
ccaaggaccc 
tcaagggaaa 
tcttagcatt 
cctggccaac 
gcacacactt 
gaggcaaagg 
cgaaactctg 
ctggttctat 
gtccagccat 
gacgacacac 



tacctgagct 
acatacaaca 
tgggctggga 
acgccactgc 
ttagcagcct 
agccattgtg 
gttactttat 
tagaatcttc 
ctgactggac 
tgctggcctc 
ccctctacag 
actctaagcc 
agggacacac 
tcccactctg 
cctccctccc 
ccttccttcc 
tcctttcatt 
tctccctcct 
ccttcctctc 
ttcctccctt 
cctcctttcc 
ccttcctccc 
ccctccctcc 
ccctctctct 
tctctctttc 
gaacagcagt 
ccacctcagc 
tttgtatttt 
ggctcaagca 
cgcgcccggc 
taattgtccc 
ctaagctcac 
tttttttaaa 
aacatgtccc 
aacaccactc 
gggccatctt 
ggcctgaaca 
caggaccctg 
gtgctcagag 
tcttcctgtg 
gcaggcggat 
aaccccgtct 
cagctactca 
gagccgagat 
aaaaaaaaaa 
tagacggggg 
agtccattcc 
ctgacctcta 
tgctgccttc 
ggagaatctt 
agccgtcgga 
ctcctgcatc 
tactttgagg 
ccagtgagat 
ctgttactga 
cagaaatcac 
ttgggaggca 
atggtgaaac 
gtaatcccag 
ttgcagtgag 
tctcaaaaaa 
tctcaacctc 
gacccactgt 
tggcccgaag 



ctggagtttg 
attagtgagt 
tgggaggact 
actccagcct 
ataattccaa 
tgtaaggcgt 
ttaccaacat 
ttaaacatga 
ccaagtggac 
tgggtttgca 
tactcagaac 
catctatcag 
agctcttagt 
agatttaaga 
tccctccctt 
tctccctcct 
ccttcctctc 
tttcttcctt 
cctccttttc 
cctctccctc 
ttccttcctc 
tccctccctc 
ctcccttctt 
ccttccttcc 
tttctttctt 
gtgatcatgg 
ttccccagta 
ttgtagagat 
gtcctcttgc 
caatcgcaaa 
caaccactgg 
actctcaaag 
atttattttc 
ctacagtttg 
tgcaacgtat 
tccctttcag 
gagggccaga 
gacaagtcac 
acaaggtgga 
gccgggcgcg 
cacctgaggt 
ctactaaaaa 
ggaggctgag 
cgcgccattg 
aaaatcctct 
cagattcctg 
catggcacca 
cctcctttgc 
tgaggtcacc 
cactctccca 
taaacctggg 
tcggtttctt 
atgaggtgag 
ccagctatta 
gctctaacca 
acagaagagg 
gaggcgggtg 
cctgtctcta 
ctactcagga 
ccaagatcgt 
aaaaaaaaaa 
agctccgctg 
ccagcctcct 
tcttactccc 



agaccagcct 
cttggtggta 
gtttgggctc 
gggtgacaaa 
agcagaaaac 
gtgtgtgcaa 
ttacatttaa 
ggtgggcaac 
agcttgagtt 
atcccaactc 
ctcactggct 
gtaaaaaacc 
aaacagagtg 
aaaaaaaatg 
cctccctctc 
ttcctccctt 
cctccttttc 
cctctccctc 
ttccttcctc 
cttttcttcc 
tccctccttt 
cctctctccc 
tccttccttc 
tctctttctc 
ttgacagggt 
ctcactgtag 
gctggaacca 
ggggtttcac 
cttggcctcc 
gtttctaaag 
tagcagcctg 
ttacactcac 
tatttgtttt 
gcttttaatc 
ccttcccacc 
gacacacagc 
gaggtggagt 
tcctctctca 
tgaaacaatt 
gtggctcttg 
caggagatcc 
tacaaaatta 
gcaggagaat 
cactccagcc 
tcccctgtca 
ggggtggttc 
ggtcactggg 
ttcctccaaa 
tggccatacc 
cctctccctc 
tttggatcca 
cttcgtcaaa 
agggaagcca 
tcacctctgg 
gaccctccct 
ccaggcaggc 
gatcacctga 
ctaaaagcac 
ggctgaggca 
gccactgcac 
gaaagaaatc 
ctccactggc 
ttcctgcagg 
agtcgccact 



gggcaacatg 
catgccagtc 
aggaggtcaa 
gcaagaccat 
cagctggctg 
atgtgtgttt 
ggaaatttca 
atgagacccc 
tccagttcaa 
cctaccaccg 
gcctgcaacc 
tgaggcccag 
gaggccaact 
tgattttaaa 
cctcctttcc 
cctctccctc 
ttccttcctc 
cttttcttcc 
tccctccttt 
ttcctctccc 
ccttccttcc 
tccattcccc 
ctttgtttct 
tctctctctc 
ctcagtcctc 
cttctacctc 
cagtcacgag 
gttgttgccc 
ggaagtgctg 
taggaaatgg 
gcttgtctct 
atgtctacag 
ccccatgaca 
ttatgattgc 
aacccagcgc 
acagtccctt 
cctgcaattt 
gagcctcagt 
tctgggcttc 
cctttaatcc 
ccgtcaggag 
gctgggcatg 
cgcttgaacc 
taggcaacaa 
ggccttagta 
taacccacat 
acaaagggcc 
gtgcccagag 
ccaagtagtc 
cattcctcag 
ggctttctac 
tggggacaat 
tcctcctccg 
tgatgctggt 
ctagtatcca 
gcagtggctc 
ggtcaggagt 
aaaattagcc 
ggagaatcgc 
tcaagcctgg 
acatagagga 
taactcagag 
gaaccagctt 
gcactcctgc 
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19621 



ctgggcaaca 
ggactctggg 
ggacacatat 
ttcaaagtat 
agccatgact 
ctttgagctc 
aaggcacgaa 
atgaatcatc 
acctctcatt 
gacttcagct 
gcgKcattta 
aattcagaaa 
gggtacagct 
gctccctcgg 
aacattcctc 
atctcagtct 
ctgtatttaa 
caggtctaca 
ctgtggaaac 
ttccatgagc 
gtgtcccatt 
tgcttctgaa 
acggatccac 
gctgtccccc 
catttcctct 
gccccctgcc 
ggagtttgag 
cttacctttt 
aatattagat 
aatcttatcc 
cttctttgct 
ggggacagaa 
tcccaccgtt 
gacaccactt 
gccccaatct 
ttggggtgac 
cacacctaag 
ggtctatcac 
tgttttactc 
ctcacataga 
ttgccccagt 
ggttcccaaa 
gagggttctg 
gtggtcgcca 
cgatgaattc 
caggcctgga 
gccacaggat 
cagacgctgg 
gaagggacac 
ggaaggggaa 
agtagggagt 
aatgcggacc 
taagggaggt 
gggggcttcc 
gtgacagtgc 
ggaaattgga 
gagtgtagct 
taagcctgtt 
gtgagctttc 
tttttaagat 
ctatgttcta 
gttgcagacc 
ggtggacaga 
agcttgtgga 



acagtgaaac 
aactcacggg 
agagacacag 
tagtagagca 
ggaagcctgt 
cacacaggac 
gaagaagggg 
tgtttctcca 
aaactctttt 
ttgtgatctt 
aatattcaga 
acactggagc 
cggcttaggt 
cctttgagct 
cctgcagcct 
tggatggggt 
actttgtggc 
gtattgagag 
aaatccaccc 
cttacatcag 
ttctctggac 
agtacaggcc 
ttttgacgca 
atggtcctta 
atcgtggccc 
agacccagca 
gtgcgtggct 
cttcacagga 
tttcctagtg 
Ytccctgatc 
tctgggcttt 
cagcacttga 
ccctggggta 
gttggggaaa 
ggttccttcc 
tgatctattg 
ggtccctagc 
acgatgggag 
accaaagatt 
actgaactga 
cctggctgct 
tcctagcccg 
ggagcagctt 
ggcacaagga 
tcttctaacc 
agcgaggtga 
ttttccaatc 
tagagagacc 
cagatggggt 
agccttaatc 
gaaaagcaca 
ccatgactgc 
gggcaggggg 
gaaaaggccc 
cttctttact 
ggacctgaaa 
cgttccctac 
tttaaaaaat 
tctctggatc 
ttttcatttg 
cttgcatcca 
acaaagcctc 
gcaatagtga 
aaggggacaa 



tccatctcaa 
gaaggttgga 
agaaggagca 
caaggaagag 
gtcccatggc 
aggactctga 
tgtcaatgag 
cttccattct 
tgcagcagca 
tgggtcagac 
agagctgagg 
aatagacttg 
gacccatcgt 
ttcaagcctc 
gctcctcaga 
tagcagtgtt 
gtaggcatca 
acagaaagtc 
tccttacttc 
actttgtcct 
ctctttcatc 
actcctgcat 
gtctgttctg 
tcaaactccc 
tgaatggcag 
gcaattccat 
tgattttcgc 
aatcccatca 
attcttttct 
ctctgagaaa 
taataatccc 
ctgataaagg 
ccccctggag 
ggcagctcga 
ttgatggagg 
ctgcttaggc 
tcttgtagaa 
aaagctggca 
aggaagcctg 
atagaaatca 
gcaccctcac 
tggatataca 
tgttcctggg 
aagaaagaag 
ctggtgcaga 
ggctcctccc 
agccatggga 
tgagacagag 
ctgaggaaga 
tgcttctcaa 
ggtggggctt 
agagcctagg 
cagaggcccc 
aggttagggg 
ctgtcctcta 
gaaaaccctt 
cgctgagagc 
gtgcacgcac 
attcaaatat 
tttttgtttt 
aagggtccta 
aggataattg 
acccttctca 
ggaatgctgg 



aaaaaaaaaa 
aaggccaggt 
tcgtaggggg 
gctgggacag 
attgtcatct 
ctcatctttg 
tgcttcataa 
gtggtctccR 
ctttctctta 
accatgatga 
ctccgcaggc 
ttgaattcaa 
ccccatcagc 
agcctcgtgt 
cgcttatctc 
ttctggatca 
gtttccctaa 
aagaagttct 
ctggagaggg 
gaactccacc 
tccagtccca 
cccttctata 
ctctcccttc 
atctacagcc 
tcacatgtac 
gcccctgtct 
ccatcattct 
ttcaatccac 
gatagtaagt 
ccagactcct 
cctgagatta 
cccccatcct 
gccagcacac 
gcatcccatg 
acgacattaa 
caagtctaac 
caccacagag 
agaagaaaac 
aataacaaca 
tctcatttta 
tacgagccct 
tttcttatcc 
aatctgcagg 
ggtctcgctt 
aattctccta 
caccagcatc 
ccagggcagc 
ggcaggtgaa 
aagaaggaag 
tcgcgaagta 
ttgaggctca 
ctctgcaggg 
tttcccccac 
gcctgcctgc 
gccccNggga 
ggaggcagca 
cctgaggctt 
gcccacccac 
atgcctcYac 
tgcatttgct 
cctaatagaa 
gtacaggtag 
cccccctcat 
gacatgaagc 



aaaaaaaaga 
atgcgggatg 
tatggggaag 
ggggaggatg 
gtgtgtgtgc 
ttatagatct 
atgaatgatt 
gccaacattc 
gttcttcctg 
gagatgcaga 
tgaattctga 
actcgaggtg 
tgctccttcc 
gttgcttcag 
accccatagg 
aagtagcttc 
tgcctctgtt 
ggacctgcca 
gcaattcctt 
cacagctccc 
gaccagcatc 
cctgggtcgc 
tcaaatctgc 
tctagaaagt 
ccttcatctg 
ttctggatct 
gtcctgtctg 
atgttgcttt 
cccaaatctg 
agatacggac 
ttcaccagct 
cctggtctgt 
gacctctcca 
gacatccgtc 
aagaagacaa 
cattgaaNgt 
attttccagt 
aagctgtggg 
tcacctcttt 
caatcatcag 
ggattgccac 
taagagtcac 
caccttcctg 
attacagtag 
atgagtcctg 
taaaaagccg 
ccaccagcct 
tggagcggga 
aaaaaaagat 
ggtctgcttc 
ggcactccac 
agcagctagg 
tgctcccacg 
tcagctcaca 
cccatctgag 
ggcagtggag 
tctgggtccc 
ccaaggattt 
tcttgaggta 
tttgtcttgg 
aaattggcac 
aatcatgttt 
tctggggagc 
aggaccactc 



atgatacaac 
gtattggcaa 
ctatgcacat 
aacttggaga 
cccatcttga 
cagtgagtgc 
ggatgaatga 
cctcccattt 
tcatttgcta 
tcgcaactga 
atatgcagat 
gagcagacga 
caaagctacg 
tggcagctgg 
gccctgctcc 
tttccttgtg 
atccatcttg 
ggaattccca 
caggaaggtc 
ctgtctgagc 
ctcctgcatg 
caggaaactc 
ccagatctca 
gacttctgat 
tttggctctt 
cagctcgaga 
gcttgcatgc 
tccatgtacc 
tcttttagcc 
ctttcaaatg 
aaaggatggt 
cactctcaca 
gatccctgtt 
tgaaaggttg 
aggagagaat 
ctccaaatct 
tcaaggtttt 
aatcaacaat 
tgagtacagc 
acatctgtct 
gtcgtccctt 
cctactatgt 
ccttcctgct 
ggaaaacatg 
gggtgaaacg 
gctccctgga 
ccccagctgc 
gaggaggaga 
accttgatgg 
tcaatcatgc 
ccaccaggac 
gagacactag 
gggatgagga 
tccccctggt 
tctctcccct 
ctgaggcctt 
tggtcttccc 
ctttaattct 
gcgggaagat 
ttttgctttg 
caaagggttg 
tgtgttgaga 
acctgctctc 
actttggaca 
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19681 
19741 
19801 
19861 
19921 
19981 
20041 
20101 
20161 
20221 
20281 
20341 
20401 
20461 
20521 
20581 
20641 
20701 
20761 
20821 
20881 
20941 
21001 
21061 
21121 
21181 
21241 
21301 
21361 
21421 
21481 
21541 
21601 
21661 
21721 
21781 
21841 
21901 
21961 
22021 
22081 
22141 
22201 
22261 
22321 
22381 
22441 
22501 
22561 
22621 
22681 
22741 
22801 
22861 
22921 
22981 
23041 
23101 
23161 
23221 
23281 
23341 
23401 
23461 



aaggcctttg 
tgactctcct 
aatttcatcc 
gtgctagtac 
aagttgttct 
ggaggctctg 
tcagctcctg 
cagtttctct 
ccttataagg 
tctccagatc 
atcacaagtc 
aagatgtatt 
aagccgcatc 
tcttcctgtc 
ggccttgaga 
cgctgtcaaa 
tcggcaagga 
gtttgctcca 
actaaatgct 
aagaaggagg 
tcctgcccac 
gtcaatggat 
gggtgggaga 
agagtgcggg 
gataggaaca 
aatctggtga 
aattcttagg 
tattcgttta 
aaattcctgg 
cccacggacg 
tgcctaaata 
cagagatttc 
cctgttcacc 
cgcagaacaa 
tgaagcaaat 
cagaggtacc 
ccagaagaac 
agacagtgag 
cggccacagc 
tccccgtctc 
caccctgggc 
gccagcctag 
ctattctctt 
gggaaacccc 
ctgggagttc 
gaaggaagct 
aaaagaccat 
caaattgtgc 
aacgcctctc 
tggaaggagc 
ttagctggaa 
tagggaagag 
cccatgtgaa 
ttgagactgg 
tgcaacctct 
attacaggca 
tttcaccatg 
gtctcccaat 
taaacaaggg 
acattttcag 
aacacggacc 
atgacagcct 
gggagctgcc 
tgtaatagcc 



acatttctcc 
atgaaaaaca 
tcactagatc 
ctctcctgct 
cttccagtag 
ggagaagcaa 
gcttctccct 
ccccttctat 
acccttgtgg 
cttgactgga 
ctatgattaR 
cagaaaattg 
taaggatgct 
tgccgctgcc 
gtgaactgca 
gctccgaaga 
agaggaatca 
gcagcaggct 
gatcacttct 
actcaaagtg 
Rctgtttcct 
aaccaggaac 
aattccaaga 
cgccaggtca 
aggttccaaa 
gtggatacct 
cctgggtcag 
cctttgatta 
gtggataact 
gggactattt 
ttttactcaa 
tgccatcatc 
tctccagtgt 
agtaggaagc 
aaacacagcc 
aactctgttt 
cccacgtgtg 
acggtatcga 
gctgcccYgc 
tattcccctc 
tttgcctcac 
gtcttgtcca 
gggcattaca 
acaaaactag 
attccaccca 
aactccaatt 
gagggctgaa 
tgtgctttcc 
aaggacggtt 
tgcatcctcg 
tttcaacatt 
gaacatacat 
tgctcattgg 
gtctcactgt 
gcctcccagg 
cccgtcacca 
ttgaccaggc 
gtgctgggat 
aagaagtcac 
aatgggctca 
ccactcatca 
taggtcccat 
cttaggatgt 
agaattcatg 



tgaaggcgtc 
aattcagaaa 
ttctaagcgc 
gtaacaaagc 
agttctaaaa 
ttcccttacc 
tcgtcttcaa 
catctcatct 
ataccttggg 
tcacacctgt 
gacacccaca 
caatagggat 
tccgaggtgc 
tgcctctctg 
gaatggaaag 
gctgaagacc 
tcatttggWt 
gggattccag 
ttgtgacata 
tgtaggaggc 
gccagggccc 
tctctataat 
ggccgactgt 
cgcaccagga 
atagagctgg 
gtcacaggtc 
agggggagat 
agtccatcag 
ggaaagaata 
cagtgacata 
agtcaacacc 
aagatttaaa 
ggccagtcgt 
tcatcaatgg 
tgcagctctc 
tcttagggca 
taggaggctg 
aggctgcgag 
cacctgcgga 
ccgggcagtc 
ttgctccaga 
gaagccctgg 
tagaaggtgg 
gaagaggcga 
gaggcctaaa 
cgcaaaatga 
ttcaggtgct 
gcctggccca 
cctgaagagc 
gaccaggatc 
ttagaaggaa 
agagctttca 
aagatgtcca 
atcgcccagg 
ttcaagcaat 
tgtctggtgg 
tggtctccaa 
tacaggcgtg 
tctttaaggc 
ttaaccaact 
aggctgacca 
ggaaatggcc 
gatatgtttt 
gagctgggca 



caataaattg 
ccgtttctat 
acatgagggc 
cctacagaca 
tcaaggcatg 
ttttccagct 
gccagccttg 
ccttttctcc 
cccacccaga 
gaggtccctt 
tctgtcaggg 
acctgagggg 
agctcccgct 
tttctagaca 
gagtgagatg 
atttatttgc 
tccttgccat 
ctgtgcccag 
aaaagccaga 
tgaaatgcta 
aaaagatttc 
aggggagggg 
aggggaggca 
gggcgcatga 
aagacagtcc 
atcaggctgg 
tatagggcac 
ggtcccaccc 
ctcttggaag 
Ygccagtgga 
aacttcatgg 
gggagcagaa 
caacagggaa 
aagttgccat 
agtctgtatc 
gaaaacaaca 
gaaattggct 
cacaagagcc 
cgtcatctct 
ctgctgctct 
ttcaggatcc 
atgccagggt 
gaccagtctt 
atagatgcca 
atgttgacat 
ctaaatagag 
tacacagagt 
cctgctcccc 
aatggagcac 
tgcattgttt 
taaatctgga 
gaacaacggt 
ccatcaaaaa 
ctggagtgca 
tctcgtgcct 
ctagtttttg 
ctcctgactt 
agtcaccatg 
tcaaatctcc 
tgcctaatgt 
cttggcaccc 
caaaggggac 
gaatccatac 
gtcagaggtt 



tggctgttga 
aggtttctgg 
cctgatgggg 
cagtgacttg 
ggtagagctg 
tttagaggca 
tagcatcctc 
ctctcaccct 
taatcgcaga 
ttatgatgga 
gctgttattt 
aattgaagga 
ggaggactct 
gccagtgttc 
ctgcagcagg 
atggtggggt 
ggtcatcctc 
tgttagccat 
aattagttgg 
gatttgattt 
ccgctccttt 
ctgggactct 
ggtaactgcc 
tcatggctca 
actgacttta 
atagtccctg 
aagtaattag 
tcacctggtc 
cttcagattg 
ggccattggt 
aggagttaca 
gcagtgattc 
acaactgttt 
tcctaagttg 
aatcaggctt 
tgaaattcac 
tggaagataa 
ctgctttgtg 
gctggtgcag 
cactgagaat 
catgtggaag 
gggggtggac 
tctctatgac 
atcagctccc 
gccgtctttg 
ataaatgact 
catgagtact 
ccaggtctgg 
acctcacagt 
cctgggtaac 
ggcttggtga 
agaaStaaaa 
agactctttt 
gtggtgcaat 
cagcctccaa 
tatttctagt 
caactgatcc 
cctggctcag 
tctgttcccc 
ggagctctat 
agtgaatctc 
tcagctccat 
ccagaatttt 
aagatgaggg 



tatttttggc 
ttacatgatg 
acagagtggt 
aaccaacaca 
cattccttgg 
cctgctgtcc 
tgatctctct 
actgtctccg 
taacctctca 
aggtgacata 
agcctaacac 
tcccaactcc 
gctccatctc 
aagtcagtat 
agaaagagct 
ctgacggagg 
agctggcacg 
gccaccctca 
catgatgaag 
ctcaagcact 
aacgagaaga 
ggtctcttta 
agaagaaaag 
agggacattt 
caaaaccccc 
cttgtcaccc 
tgaatatcta 
atcatttttc 
ttcacccaga 
tttacatacc 
gaggcagtta 
ctgtgactca 
gttttccaac 
ttcttgttaa 
cttggtcgag 
tggaatgata 
atagaagcca 
aggagtgaca 
gactcggtcg 
ggattctaag 
cagccaatta 
agagagtcat 
tccacatgct 
tcccaggaaa 
agtgtagagg 
gacagctcag 
cagcatctct 
cctgctgtgg 
ccaggctgcc 
aactaaggaa 
caaagatttt 
aaaaatctgt 
tttttttttt 
ctcggctcac 
agtagctggg 
agaaataggg 
actcgccttg 
aggctgttaa 
aggcagcctg 
gtaaccacac 
aaccccggaa 
atcagaggct 
ccccatttcc 
tgatagctct 
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23521 
23581 
23641 
23701 
23761 
23821 
23881 
23941 
24001 
24061 
24121 
24181 
24241 
24301 
24361 
24421 
24481 
24541 
24601 
24661 
24721 
24781 
24841 
24901 
24961 
25021 
25081 
25141 
25201 
25261 
25321 
25381 
25441 
25501 
25561 
25621 
25681 
25741 
25801 
25861 
25921 
25981 
26041 
26101 
26161 
26221 
26281 
26341 
26401 
26461 
26521 
26581 
26641 
26701 
26761 
26821 
26881 
26941 
27001 
27061 
27121 
27181 
27241 
27301 



atggttaccc 
cagactttta 
ccatagctgg 
ataatcatgc 
ggaattgagg 
gtaaatgtga 
accacgagaa 
ctgatggtgg 
acaaatatac 
gatcgtgtgc 
tgtgtgtggc 
ggttcatagc 
tagcaagYga 
tcccattagg 
ttagttcaag 
tcagagacac 
gggcaaacca 
gattttgttc 
tttgtttaag 
actagcagta 
tgtcagggca 
caaagcttaa 
acaRacaaat 
cttcttcagg 
gttacaggga 
ctgggtgcac 
ctcttgcccc 
aaatacagat 
ggcctccctc 
ggaggtctcc 
ctgtccttct 
aacaaatcct 
agcagcctac 
cattgaagac 
cagtgtggag 
aagcttttgc 
tggatgaggg 
cctctatttc 
cttgtgtgac 
gggccaagcc 
gtcgcccaga 
tgagtccctg 
ccatccatca 
agaacacaag 
ccagctgatg 
tctcctcaat 
ttccttgttt 
tggagtgcat 
cgcctgcctc 
tttttgtatt 
tgacctcagg 
accgtgccca 
tggagtttca 
atttgtcaag 
acaaaaactt 
tgactcgaat 
ctcagcagga 
ttttctgtct 
gcatactctc 
ccatggctca 
aaaggggatc 
tgactgagat 
gaccatcacc 
gtggagccgc 



acaataaacc 
ggtctggaag 
cttctcatgc 
atcatgttcc 
gatcccctag 
ataggaaatt 
tgactttgcc 
cagcaatggt 
acacacaaaa 
tgaggccctc 
tctcactggg 
agataaggag 
ttgactgtat 
gtttgtgtct 
aagagtcact 
taggaatcaa 
tatattcctg 
ttccttccat 
acagatagtt 
cagtggaatt 
aatgctggct 
gtgaaatgta 
gaaacacaat 
aagaaataga 
ctgaggcaag 
tttacgagtg 
tgctcYgggc 
aattaatgtc 
tctcctccct 
agtcgcccac 
ctgtgtcact 
ctgcctggga 
attagacagc 
atggaagcat 
gatgggcagg 
aggagtctgg 
tgtttgcaga 
tcatggactt 
ccatgtcatt 
agatgtctac 
aatgaccgac 
gaagaaggat 
gcaccacaaa 
aggagagagc 
aggaaactga 
gcaccacaaa 
ttttttgggg 
tggcgccatc 
agcctcctta 
tttagtagag 
tgatccaccc 
gcctcctttt 
ttggaatgtc 
tcctcctgct 
cagacaggaa 
aagtccagct 
tggctctaga 
ctttgcctgc 
caggatgttt 
ttacctggcc 
catcacacta 
acacttccag 
agggaaagac 
tgtttcttct 



actttcaaaa 
tttcagtatc 
tccctgatgt 
tccagggaag 
gattgctcct 
tcaccaactt 
cataagtaag 
cagtagtggc 
attctgagac 
ctcctgtgac 
gaagaggcaa 
agttcttttt 
acaatgagca 
ctgttaccca 
aacccaagct 
agtgtgtgga 
atgttggtaa 
cctctgacct 
tctgtttctg 
gcatcaaatg 
ggatcactga 
tgaaaagaat 
gttctggcgt 
tgaatcaact 
tgttctgcag 
acttaggaga 
tccagctgag 
cactggggca 
gtagaggaac 
atgggtgacc 
cctMccagta 
gccttgccta 
atgaaggctc 
gggcttgaWg 
gatagtggca 
gtgaaatgag 
ctggatgagg 
tctctcaacc 
cgtgggccct 
aagctttccc 
cagaagaaca 
catgaggaag 
ggcacaggca 
caggtggacc 
ggctcagaga 
gttccttgat 
gggggacgcg 
tcggctcact 
gtagctaaga 
aaggggtttc 
gcctcggcct 
atttttcttc 
atcgttccct 
gtctgctgac 
cttgctccca 
tggtgagtag 
gatccggcct 
tgtatatgag 
gggtttttag 
ttcagggttc 
caaccaccac 
gaggacaaga 
ttcattcttg 
gctgtatcca 



attttgttta 
taaggttgac 
agggatagca 
ctcatgttgt 
tgtaatactc 
actgaggcca 
aggaacattg 
tgtagccttg 
agaagctgcc 
ataacttcag 
gtgcatttgc 
gtttgtatga 
caaattcagg 
tccccattcc 
attctgtcta 
gctaagccag 
tggtctgctc 
gctctagttt 
ttgcttgcaa 
caggtagcct 
agaagtaaat 
gggaattatc 
taggcaagtc 
gaaggtcagg 
gatgctacat 
ggttctgggg 
gcgcccagca 
aggccatggg 
attctgaaga 
ctatcaagaa 
ccccagtcct 
gggctctccc 
tgggcttttg 
aaatctRtgt 
agggctgtat 
agggctgact 
gtgtttgctg 
tacacacagg 
ggctgctaga 
tgagtgcctc 
agcagaaggg 
tgaagtcagg 
tagatcagtg 
tgagatatca 
ggcggggtct 
tggttcctgg 
ggggatggag 
gcaacctcct 
ttacaggcac 
actgtgttgt 
cccaaagtgc 
acaaatcaat 
cagtgtatcc 
atcatcacca 
aagcgagaaa 
gtgagtttat 
cccccagtct 
taatgagact 
agacacctgg 
aagcagggga 
ctccaccgcc 
cagagtggat 
gaaggacatc 
aaagttctaa 



tgatacatac 
aaaagttgtt 
atgacgtcac 
tattacacag 
taaagtgtta 
gaccatcaag 
agtggaacaa 
accgtacctc 
aatggaccaa 
catgcagaag 
ctttttcctt 
gaagaagagt 
tggctgtttg 
tgataccata 
ctgagtatgt 
gtgcaggcag 
ctgtccttcc 
tcatccaatg 
ttgaaaaatt 
gatggcattt 
tactcagtgY 
caggaattga 
ggtggccgtt 
gaagccttgg 
aaaaaaggac 
tgattKtgtg 
gttctatatt 
ggacaggagc 
cgctccttga 
tttctcccRg 
gctcctagaa 
ttcaggggaa 
tgctaagagt 
tcttacgggg 
aacagggaag 
ggatgagggt 
tctggctatc 
cccctctgat 
tggggaccat 
acaggccact 
gccatggagg 
caccgagagc 
ttgcttgaac 
tcttgtccag 
cttgactgca 
catctggttt 
tctcgctctg 
cctcttgagt 
ccaccaccat 
cctggctggt 
caggattaca 
gacttttcca 
tggccccaag 
tcacctttac 
ggagccagga 
taggacttac 
ctaaactgct 
gtttttcttg 
tcctcagctg 
catatacccc 
atcatcaaga 
gctggaaaga 
gaaccggggg 
ctcttcggct 



tcctaactca 
tcagcagaaa 
tgggttggat 
caggacagga 
gagtattagt 
gagcttagac 
ttcaagactt 
tttattttct 
aagaaaaatt 
gtggagatgc 
gggatggctg 
gtgtgtgcag 
gccagaggct 
gcctcactga 
gaaactttaa 
agccctgaaa 
aaattctgta 
aatcttctat 
tttaaacaat 
caactctaag 
cagaaacatt 
tgaacaattg 
gctctgttga 
ctgatggaga 
taagtaggcc 
cattctgtat 
ctatgcatgg 
agtggttaca 
agactcctca 
catcagcctt 
tcacttccca 
ctcaggccac 
aataggaagc 
caaMttggcc 
accattctgg 
gtttgcagac 
tctctctgtt 
tttagtcagc 
gcctgcactt 
gggcactggg 
cctagctccc 
ccaggggagc 
ccagatctdc 
cccctctttc 
ggctggttca 
tgcttttctc 
tctcccagac 
tcaagcaatc 
gcccagctaa 
ctccaactac 
ggcctgagcc 
atacagaatt 
cagggctgac 
cactgttcag 
gaccaaggaa 
acacagggca 
tttcagttaa 
gtaggttctc 
gggacaatgg 
taaataacct 
agccactggc 
cagggcaggg 
caggtcggta 
ttctgcattt 
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27361 
27421 
27481 
27541 
27601 
27661 
27721 
27781 
27841 
27901 
27961 
28021 
28081 
28141 
28201 
28261 
28321 
28381 
28441 
28501 
28561 
28621 
28681 
28741 
28801 
28861 
28921 
28981 
29041 
29101 
29161 
29221 
29281 
29341 
29401 
29461 
29521 
29581 
29641 
29701 
29761 
29821 
29881 
29941 
30001 
30061 
30121 
30181 
30241 
30301 
30361 
30421 
30481 
30541 
30601 
30661 
30721 
30781 
30841 
30901 
30961 
31021 
31081 
31141 



tcagctcttt 
tccccccaat 
agtcccctct 
ccatctgcat 
ctcagctcca 
tcgtggcctg 
tggcatggat 
tgagcctgcc 
gactcgtata 
aggatccggg 
ccWccacttc 
tgcctgatgg 
agggaaagta 
atgctggttg 
acgatgctgg 
gtaagggcca 
atgccagagg 
acctcttcaa 
ttgacttggg 
tattcgtcct 
gcctcatccc 
aatcagctca 
aaataaagct 
ttttgtatat 
catcttcaga 
gtcaacatca 
agtgggttcg 
ttattagaag 
ggaatgtcct 
tagcaatgtt 
aactgggaac 
ggacagcttg 
actttctcct 
accttccatg 
ccaaggcagt 
cggtcttcac 
aaacccccca 
gttggctgtg 
ctgtaacNca 
taactgggtg 
gagaacacag 
ctgtgtgtct 
tcgtgcaata 
gtgggggctt 
gacattgcat 
gagatcacct 
tgaaggccac 
agtcctggct 
gcttcagggt 
tcacctgtgt 
actttgaaaa 
atagagaaag 
ttacccagtt 
tactctatgt 
catgctgtca 
cagggaggga 
tacMtttcat 
ccttcctgct 
ccctgaaacg 
cctgctgttc 
ggagagatgg 
acagagccaa 
gtcagcagga 
ggacagcatc 



cttttcctgg 
atattcgtta 
cctcccttat 
taacccctcc 
taaactttac 
tgtatggctc 
tcagacgctg 
gcctcagctc 
caaggttgac 
ctcctctgct 
cagctgagat 
cacggatttc 
agttgggtgt 
cagtcagcct 
tggtgatccc 
gactcgtccc 
cagcgccagt 
taccatttgc 
gaaactcttt 
caatagctgc 
tgtaccaata 
ataagatctg 
ttaaatttta 
catagatatg 
tgctctttag 
tccctccaat 
gagcaaaaag 
agaggtcgtt 
gcctggtagg 
atgggtgatg 
taagggtgtt 
tgatccagcc 
tggccaaact 
cccatcccgt 
tagggagggt 
ccgctcctta 
ttctggtgtc 
actccccctc 
atcctgaata 
agcttctctg 
cgtcctgagg 
cttccttagg 
acagttatca 
aagaaacacc 
tttgactaac 
tgcctggcat 
tgggtccagc 
gcacatgaac 
ctcactgtca 
gcatRccaat 
tcatctctat 
tgtttttctc 
ggcagacact 
agaacagaag 
gggaaacagt 
agaaatatcc 
gccctcagtg 
gctcctccaa 
tctctctggg 
attctcaggc 
caggtcccag 
gatccacccc 
ggatcctgct 
tccaggaaca 



ccttctcatt 
gtctaaagga 
tccaggctcc 
cctgctgtgc 
cttgccctct 
acctcccctg 
atagatctga 
ctcagcagat 
cacatccagt 
cactgcccgg 
tcgccaggtg 
actcccaatg 
gatgtcacgc 
gctggcttca 
agtcacagca 
tgctgtaaat 
ggagctggac 
aagggcacga 
caaaaaccat 
atatgttcta 
aaggacagat 
ttctacataa 
aatggaaaaa 
ccacagtact 
tggtaggtat 
agtgtgtgga 
accttcattc 
cttttgtctt 
aacatctttg 
ggttttgggg 
agacctcagg 
ccagaaagag 
ccagtcaggc 
gcttgctggg 
cctccccact 
cctgccttta 
tccagtcaga 
tgtctttact 
aaggcttcct 
gttggcaaca 
acaacgaagc 
ctgattttaa 
gtgagttacg 
ttcaaatctg 
tctgagtcct 
gttctctctt 
cactgccccc 
tctcaagcca 
catgtcRcca 
gccatgtgca 
aaatcactaa 
tttggttggg 
tgttccaact 
agggtcccaa 
cacccatggc 
ctctcccact 
cctccctctt 
cccaggccca 
tctttggatg 
tttaaccaca 
tctccaagac 
cagctctgtc 
gtgtttgtgc 
cacattgatc 



gctggttcct 
aatttcttct 
agcattcctg 
tcagctacag 
ttatccccct 
ctggctgcac 
gtgagctcca 
gcagactttg 
gcaaggaaga 
gtggtgcctg 
gtcacaggga 
gtttgtgtgg 
ataacttcct 
gcttctgctg 
gacgctgctc 
ggtgccaaaa 
accacRttgg 
agcttttcta 
tccctaaact 
agcttcttca 
gattaagaaa 
atcacagagc 
tttaaaataa 
tattctgtat 
tcgataaaga 
gaggggaaat 
agaggatcat 
tgggccctgg 
tttccctggt 
ctattacgta 
gaggggctgg 
ctctgtacat 
tgctctgagc 
cctgtactgc 
tgctgtctga 
gttgagcagc 
actggtccat 
atattgggag 
gatctaatgt 
ccccatgtgt 
ttcacacctg 
tctgtgttct 
taagtctttg 
caactggtgt 
taagctatca 
ccaatgccac 
tttgactgaa 
cttgacKgcc 
gccaaactaa 
atttcatgga 
tgctgaaaag 
gacttgtgag 
ctccaagctg 
atggcagttc 
ccactatgcc 
cactggtcct 
ctcaccctac 
cccctgcccc 
gcatcatgct 
Ygcccatctg 
ataaaacaac 
caactccatt 
agttcctcag 
aggtcaagat 



gcacacctcc 
tcctattccc 
ccttctcctt 
aaatgccaaa 
aataaaatgc 
tggtctgggg 
ttagattctc 
ccccctcatg 
tgcctgaagt 
caatcgttct 
gtcgggccct 
cttcgtaata 
taaatacctt 
atgatgtgta 
ccagccctac 
caagaccagc 
agatggtgca 
tggactcctg 
gctcatcttt 
gagcttcgta 
ggcagcttac 
tattactatg 
tttttccaag 
gggaataaaa 
tggctcctca 
tttgtagatg 
agaacagtga 
gaagggatct 
ggtttggcta 
tcttttctgc 
ggactccatg 
tgaggcatag 
cttttctcct 
cccagttcag 
gcattcttgc 
ttgttaggcc 
ctcccactcc 
ctgggctcag 
gattcacagg 
tttgtcccac 
aaaccctccc 
ttccctgtga 
tagtgaatta 
cagaaRgaag 
aataaaacct 
atgccctttg 
aagtgaaagg 
tcctgacctg 
tgccttagca 
acctcttatg 
acctgcattt 
tctcctgaga 
gcccgaagaa 
agaaYgcccc 
catcagtgat 
ctgcaagtcc 
tcccagcctt 
atcctgaccc 
acttggttcc 
attgaactac 
agccccagag 
ccagcctctc 
aggtggaaga 
ttttctggct 



cctctattcc 
cacactctcc 
ggtgcacttg 
gtcactaaca 
ctgcattttg 
tcagtgggta 
ctccagctcc 
caagtgcttt 
ggtcgcactc 
ctctgcttga 
ggctctggct 
attattaaga 
caaKcggtca 
tgagtgctcc 
cccagctgca 
aagggacatg 
gcctctgtgg 
gatcttcctc 
ctgctgcaca 
gagagcatct 
ttatctgtaa 
agtcaaatgg 
gttaaacatg 
aacttaaagt 
agcaagagag 
aacaaagaga 
aggattaact 
ctggggccct 
cagcacagta 
cttccaaagg 
tcagccatca 
gttgagaaag 
ctaggcgctg 
caagaatccc 
tgtctgatca 
agtttagcag 
tcaccctggt 
tctccctccc 
aatttttcct 
attcacacag 
agactctgcc 
taaaccacaa 
tcaaacctga 
ggtgccttca 
cagactggaa 
atcttctgac 
cctggctatt 
gcacctccaa 
ggtgacagct 
gactcatgag 
ggacaagatg 
acccttcaaa 
acacttcctc 
agttcccagg 
gggatcaaat 
atggccacgt 
ggtgcccgag 
ctggctctgc 
ccacctYtga 
tattagcatt 
ctgtgcaggg 
tgtgttctgK 
aaagggcccc 
tttttcaaag 
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31201 
31261 
31321 
31381 
31441 
31501 
31561 
31621 
31681 
31741 
31801 
31861 
31921 
31981 
32041 
32101 
32161 
32221 
32281 
32341 
32401 
32461 
32521 
32581 
32641 
32701 
32761 
32821 
32881 
32941 
33001 
33061 
33121 
33181 
33241 
33301 
33361 
33421 
33481 
33541 
33601 
33661 
33721 
33781 
33841 
33901 
33961 
34021 
34081 
34141 
34201 
34261 
34321 
34381 
34441 
34501 
34561 
34621 
34681 
34741 
34801 
34861 
34921 
34981 



aaagtcctgc 
ctgtaatccc 
ccatcctggc 
atgcggtggc 
gaaaccggga 
gacaagcaag 
aacacttttg 
cctaccatgt 
ataaagctca 
catcaccagt 
aggtcactca 
acattctcaa 
gcccagggga 
tacctgggca 
atttgcagat 
cgtttctttt 
aatggcattg 
tatttttgat 
tgagaccYta 
taaagagagt 
aaaaagaggc 
tgccttccag 
cacacacaca 
agccctgagg 
gcaccttcac 
ccacccagcc 
ccaYggtgct 
tccaccatgY 
tgcctattgt 
ggttatgaaa 
atatcagagt 
atataatata 
Mtagatgtgt 
gctgtatcac 
aaggtacaaa 
attatcagta 
aaatgtgtgg 
gtatggattt 
tgcgaggcct 
cacccaaccc 
tactttgaag 
atacctgtgg 
taaactacca 
gagggggttc 
ctgtgttacc 
ctgaacccag 
ttccttgagc 
gcttctaatc 
tgtccagggc 
aaaacaggcc 
ttttttcctc 
cactagatct 
atgaacacct 
ctgccaacta 
acaaaactta 
attttccaac 
ccttattcca 
catgaaatgt 
gtcttcaaaa 
gaatgaggcc 
agggcaggcg 
aggggatggg 
gtacccatgt 
gtgaggggag 



tgggcatcca 
agcaccttgg 
taacacgatg 
cggagcctgt 
ggcggaggtt 
actctgtctc 
gtcctggtct 
gtcagtcatt 
atggatgaag 
agatggcaga 
ctgccagcga 
gttcagcaga 
gatcaggatg 
attcagccgc 
gcactgggct 
ctacttggaa 
agaataaggt 
ggaggcatca 
aaccccagta 
cgtaagggta 
agaagatctc 
ccctctctcc 
cacacacaca 
cagMctctgt 
cttggaMttt 
tgtggcgttt 
ttgtggagga 
tttctcccca 
caaacgcttt 
atagttcatt 
gtattgaaca 
tatatacata 
ttaggatggc 
tgaccactcc 
tgtttcaaca 
caaagcatct 
tttgcaggga 
atgtatgcac 
cattctcatt 
agcaagcagg 
actccatata 
gattttaaac 
tgcacagagg 
tttaggcggg 
tctgtcctct 
ctggccgcgt 
ctcccagcca 
ttccctgccc 
ggctcctgag 
ttagggaaat 
cttcactgaa 
gtgtatttat 
gggcaaacaa 
tggggaactt 
acttcggcca 
aaggtaaacc 
cttccatatt 
ctttcatttt 
ttcgattgga 
ctgcctgagg 
ccctggggga 
aaccaactct 
actatttgat 
gtaagaagct 



actctaagaa 
gaggccgagg 
aaaccccgtc 
agtcccagct 
gcRgtgRgcc 
aaaaaaaaaa 
ctccccagca 
atgctaaata 
aaaggaaggc 
actgaccctg 
aggacttgga 

gggggctgcc 

gcatagccgg 
agtcacgaat 
gactctctcc 
acaaaaagca 
ggttcgaKgt 
gtgctataga 
actatatttg 
gggccctaat 
tctctctctc 
ctgtctctct 
cacatacacc 
aagccaggag 
ttgcctccag 
tactaaggca 
accRatctat 
tgtacgggaa 
acaccagtgt 
ttgtcaagag 
cagcaagagt 
agggtgtata 
agcattttca 
cacactggtc 
gagaaaccaa 
tagtacacag 
tttgggaatt 
tgggacatga 
tcacagaagg 
aagctcagga 
ttttgaaaat 
attttattga 
attcctcctg 
caggggagca 
gcaggctggt 
tattaaacca 
agcccttctg 
tctcctcatc 
cagatgtgac 
gaacacagtg 
aggatgaagg 
tgctggggtg 
aagcaaacag 
aactggagtc 
gtcatgagca 
aaaaaacata 
caccctataa 
gatgtttccc 
cctcagattt 
cagtcatccc 
caaaaagggg 
tagggaggcc 
cctcatgacc 
ccaggtgaca 



aaacacttgg 
cgggaggatc 
tgtactaaaa 
acttgggagg 
gagatcatgc 
aaaaaagaaa 
atctttttaa 
ctttacatgc 
agaaagtagt 
cccttgtggg 
ggaaatattc 
tggaggaggt 
ggcgccccat 
ctcttccagg 
cggaagtatt 
taagattgga 
taatcctcct 
ctgagttgtg 
gagattgagc 
cctacaggac 
ctcaccccac 
ttctctctct 
ccccacactg 
cagagccctc 
aactgtRaga 
tcctgagaag 
ttcagctgag 
atattccagt 
cttcaaaatc 
cagcatttta 
gagtattctt 
tatatatacc 
atctgtgtca 
aacgctacaa 
tgaatattca 
caggattact 
atggaactgc 
ggtaaaatcg 
agaaattgag 
tttgactcaa 
gcctgtggtt 
aaaagaatat 
ctgctcatgg 
agaccctcag 
ctcaggggac 
cagatcatct 
aaggattccg 
ttggacagga 
ttggttgcca 
ctgacttggg 
aaaccagggt 
ggagtaccag 
gccttcagaa 
gtttcagatg 
gccagctgac 
atttgacaac 
atacctgcct 
acttcatgac 
ttctttgaca 
tggtaagcag 
cttccacctg 
ctactgtgtg 
acatggggaa 
cagagaagac 



tcgggYgcag 
acgaggtcag 
atacaaaaaa 
ctggggcagg 
cactgcactc 
aaaaaagaaa 
aaggtaacat 
attagcttat 
caagtagctt 
cttcctctcc 
ctcctggact 
gtgcctgcca 
ggaggtaacc 
cttcattgtt 
tggtggcctc 
agaaagtgtg 
caaccagctg 
tgccctccaa 
ctttaaggaa 
tggtgtctag 
cccgccctcg 
ctctctctct 
ggaaggccat 
cccagacatc 
aaatacatca 
agtgattcac 
aatcaccaga 
gatcgcttcc 
tgagtttcga 
aaaagaaaag 
ttgtaagata 
cttatgtaaa 
atatgacaca 
atgtttcaac 
ccttaaatgg 
gcccagaKga 
agagaaggga 
tttcctgccg 
acccagacag 
ggccagcgtt 
cctgcattct 
ttcaacccta 
gcgctctcag 
cactgatccc 
atggctcccc 
tgtttcggag 
gggttggctg 
ctcagcagag 
aggcaactca 
tgaSgggtat 
tggatggagg 
ggtgttaaag 
Wggctgtaac 
ggtgcttacg 
agacgggtac 
tgcaacaaat 
ctgacacttt 
ttgcttcttc 
aggggtattt 
gtgcttgtga 
gaatcaacag 
gcaggcactg 
atattatact 
gtggcRaagg 



cggctcatgc 
gagatcaaga 
aaWtagccgg 
agaatggcgt 
cagcctgggg 
aaaaaagaaa 
taacggagca 
ttttctcaca 
gcaaagctcc 
cctcagcctg 
gctgagaggg 
gagaaaacta 
ccacggaggt 
agtcagcagg 
ttcagtaaag 
ctacagccta 
tcacatgggg 
attcatatac 
gtcattaatt 
ttaaaagaaa 
ctgtctctct 
ctcacacacW 
gtgaggacac 
aaccctgctg 
gttgttggag 
cagggaagtg 
aagtgagctt 
tYctgccatg 
cccatcagtg 
agcattgaaa 
tttgaaatat 
ttcctgggat 
gggtgcatat 
cttgtaatca 
gatacataat 
gttgccccaa 
ctgggaagca 
tggacacact 
cgaaagacac 
cccttgccat 
ccactgtagt 
caccattgaa 
atgggcagtg 
tgtggccctt 
actatcacct 
aSagagtttc 
gtaggtcccg 
cccagtttgc 
gtgaggggag 
tttgctttta 
gaacctccag 
caaacttaaa 
cccccttaaa 
ttaggcacaa 
acgactagga 
caaataatgt 
tccatcataa 
ctcaaataaa 
gagatcatgg 
tgggccgtgc 
tggcaggtaa 
gctgggccag 
cgcattccag 
ccagaatggg 
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35041 
35101 
35161 
35221 
35281 
35341 
35401 
35461 
35521 
35581 
35641 
35701 
35761 
35821 
35881 
35941 
36001 
36061 
36121 
36181 
36241 
36301 
36361 
36421 
36481 
36541 
36601 
36661 
36721 
36781 
36841 
36901 
36961 
37021 
37081 
37141 
37201 
37261 
37321 
37381 
37441 
37501 
37561 
37621 
37681 
37741 
37801 
37861 
37921 
37981 
38041 
38101 
38161 
38221 
38281 
38341 
38401 
38461 
38521 
38581 
38641 
38701 
38761 
38821 



gctcagttca 
ggtgcagggc 
ctgggagcag 
ccgcacctcc 
aggctgtggt 
ctgggccatg 
gggggactcc 
ccgggttcct 
aggaaccagc 
ggtcactgag 
ttttttctta 
ggacagaggg 
aagaccaacc 
aaattctact 
tcctgtaggg 
aaactacaga 
gcctggcaac 
ctgcagggct 
aaccgtggct 
ttcacaatag 
tccactccct 
cctcattcca 
ggtgctgtgc 
gatgcttctg 
cacattaaaa 
tccaactcct 
gcatcagcca 
acattctgct 
atgtttaggc 
gactacagtg 
aacatttgtg 
acagtcctct 
aaaactgaaa 
ctgggttccc 
tgccaatgaa 
ggagaccgag 
gaaaccctgt 
tggcgtgtgc 
aggaggcaga 
tgggactctg 
agagacttca 
aggggtaaag 
tgctaagcag 
acagtagccg 
gccccagaaa 
gggaagactg 
agcagaaaga 
aaatggatga 
acataaaaat 
ggaacaaaaa 
aagattcaaa 
aagaaaaaaa 
taattacata 
ttgggagggt 
tagtgacacc 
ctagtcccaa 
agcaagaccc 
agaatgagct 
acactcatgg 
aacccaggga 
ccagtccagg 
ttgcatataa 
gggggaaaaa 
tgtgcaacca 



gtgggacgct 
aagagctgga 
agcgggaggt 
tggaccctgc 
gccccacttg 
ctcccctgag 
atgtggcctc 
ctggggctca 
cagggaagag 
agactttccW 
acccttgagg 
aggttggagg 
tagcccggtg 
tgctccagct 
gtatgaagag 
gtctatacac 
atgtgtgata 
cctcatatgt 
cccattttga 
gggtagagcc 
taaatatata 
aatctgtgtc 
tggacaccct 
cttcaacttc 
aaaaattttt 
ggtttcaagc 
ctacaaccag 
tgtgatgatc 
attgttctaa 
ttagtcaagg 
gactcgactt 
tccaagagcc 
cagatgttca 
acccaaccag 
aatgtgaatg 
gcaggtggat 
ctctactaaa 
ctgtaatccc 
agttgcagtg 
tctcaaaaaa 
acagatattt 
ggatctgcaa 
agcctcagag 
tgagtctaca 
taatatcttt 
ccagtggacc 
caaaaaatgg 
ttgcgaaaat 
gttttattca 
agacttcttg 
aagataggca 
caaaaacaaa 
gaaagctact 
aagcaggaga 
ccatcattac 
ctactctgga 
tgtctctgtt 
gtggaggctg 
agaggggata 
gaaggaaatg 
atagaataag 
ttcacagttt 
acaaaggcaa 
tattgtacaa 



gagtcactct 
gggaggggac 
cagaggtcgg 
tctcccccta 
agtcgtttgc 
gagaggtcag 
ttttggggct 
ctgagttgtg 
gaagaagcaa 
tggagctctc 
aggagaaagc 
ggctggatct 
ttttttaatc 
ccctctgccc 
aagataatca 
cgaaatagag 
actaattgat 
gcctcattcg 
aacagaagaa 
aggacttctg 
aaaatattga 
ttgcactcta 
ttgtccccct 
ccaagtagtt 
tttagaaatg 
aatcctcctg 
cctaaataaa 
ttgccttggg 
gaaggtagaa 
cacttatagg 
tgcaccacta 
tattaagtgg 
gcgacttgcc 
acggactcaa 
gaccagttgt 
cacgaggtca 
aatgcacaca 
agttacccgg 
agccgagatt 
tatatacata 
ggaagacaga 
tccacagcaa 
aggggagaag 
ccaatggaaa 
gtatcctgat 
tgcccctttc 
ggaaaaggtc 
agaaaataat 
taaatgagaa 
agaattaaaa 
aggaagcttt 
aacagagcta 
tgagtctgag 
attctttgag 
aaagaataca 
agctgactgg 
aaaagaaaaa 
agaggaggaa 
aggaaaacca 
caaggatcac 
aaggaaaggg 
gagaaaaact 
ttagaaacat 
ctcagctcaa 



tttcacaggg 
tggcctgtgc 
gRctgctggt 
ctccccgacc 
taggtgccag 
gacatagatg 
cagggaagac 
gaaacgccac 
gcctacctga 
cagtcactga 
aaattgtggg 
ctgcaatccc 
tcctgggccc 
tcactctcac 
gaggaaggga 
atgggaagga 
aaYagccagt 
atgatcacac 
acctcaaatg 
attcaaatcc 
ctcatgcata 
tgcccaagtg 
tgcttctgcc 
gtccctacag 
tggatcttgc 
cctgagcccc 
tgctgatctt 
aaatgagaaa 
gaacatttat 
tataataaca 
attgctctaa 
gtaccatttg 
caatctcaga 
gaacacacac 
ggtggctcac 
ggagatcaag 
cacacacaca 
gaggctgagg 
gggtcactgc 
tgaaaatttg 
actggctgaa 
caccaagagc 
ttcaccagcc 
aaattagaga 
gccaaggagt 
tttaatatga 
aagataacaa 
ttttaaaaac 
cattatgtac 
ggtgattgag 
ccagaaaatc 
gatatagaat 
catggtggct 
tccaggagtt 
aaaatgaacc 
ggagaatctc 
caagtaaata 
gaatgtgagt 
agagagaaaa 
agctgggcat 
ctctgtaggg 
gaggaaatca 
caggaaaaac 
cagtaagcaa 



ccactcagag 
tggaaacacc 
caacctcctc 
ctgctgtttc 
ggcagggagg 
cccccacaga 
tcatcggcct 
cctccattct 
gtccatggtg 
cctgactgga 
actgaatgtg 
tttgcgtgtc 
cagccagctg 
accaaggcag 
tggacatctg 
acgttctgac 
gtgtactggg 
acaggccagg 
ttaagcaacc 
catgtcttcc 
catgcttaYt 
tctgtcttgg 
tgtatgtcct 
gtacatacca 
tttgttgccc 
ccaaagtcct 
tatacagaga 
aaagaaagca 
tgactatccc 
gtcaatgaga 
gatgttataa 
tatttccatt 
aatctactaa 
actgcctctg 
gccttaatcc 
accatgctgg 
cacaaattag 
caggagaatc 
actccaccct 
aatgaggaaa 
gaagtgacaa 
acattcagcc 
ccacgaccca 
cttcataact 
tctcatttca 
atagatctag 
gtaaaaaatg 
tcaaacccat 
tgaaaaggaa 
aaaataaaca 
aactaaaagg 
aagaaagtta 
catgcctata 
ctagaccagt 
aggtgtggtg 
ttgagcccaa 
acagaaaaga 
gtgtgtgtgt 
agacgtgaaa 
gcagcaggcc 
aaaggggact 
tgaaggcaaa 
taaagccgtg 
gatttaaaRc 



cagaggggct 
ccagggtcag 
tctcagccct 
cacttcccca 
aagtgggcac 
cccttgggtc 
gcttgaccat 
aggtgcgagt 
ccagcagcca 
agggttcgtt 
agccacctgt 
agcaaatgcc 
gtatttagag 
ggtccttttg 
ataatcacag 
aatgacctgg 
tgtctgtgat 
agatgggtgc 
ttgccatgtt 
atgactgggc 
gtaaatgact 
tgaattcact 
ggctgcaagt 
ccacgctagg 
agagctggtc 
gcgattgcag 
aacactcttc 
aatgttgatt 
ttcaagatct 
aataggggtg 
ataataactc 
ttcagataag 
ggtgtaaaga 
caccctctgc 
cagcaatttg 
ccaacatagt 
ctgggcgtgg 
gcttgaaccc 
ggtgacagac 
acaggagatc 
ctgaatagag 
aagaattgcc 
tacaggattc 
gcatctaaag 
tcactctaca 
gatcagataa 
agtccagaag 
gtcctcaggg 
caaagatgat 
actctaggac 
atgacagaaa 
ccatacataa 
accccatact 
atgggccaca 
atgcatgcct 
gaggtcacag 
aagttacttg 
gtgtgcacgc 
gagaaaaatc 
tagaaagcaa 
cgatagaaca 
cagtacaaga 
cagcaaaaca 
gtcaccaaaa 
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38881 
38941 
39001 
39061 
39121 
39181 
39241 
39301 
39361 
39421 
39481 
39541 
39601 
39661 
39721 
39781 
39841 
39901 
39961 
40021 
40081 
40141 
40201 
40261 
40321 
40381 
40441 
40501 
40561 
40621 
40681 
40741 
40801 
40861 
40921 
40981 
41041 
41101 
41161 
41221 
41281 
41341 
41401 
41461 
41521 
41581 
41641 
41701 
41761 
41821 
41881 
41941 
42001 
42061 
42121 
42181 
42241 
42301 
42361 
42421 
42481 
42541 
42601 
42661 



tgtaaacact 
taaatctact 
cactgaagga 
tactggctac 
aacaaccatc 
ttggagaagt 
aagtgcccaa 
tatgtaaaga 
tgtaatccca 
agcagcctga 
gcatggtggt 
gaacccagga 
aacaagagtg 
ctgcttttaa 
gtatacactt 
ttagtaactg 
gagaagtgtg 
cttcagagaa 
attttatgtt 
gtataatttc 
ccaagacgtc 
aaggacgctt 
accgctgtga 
caaggttgag 
ggtggttggg 
acttgagaaa 
ggctacaggt 
gagatcaata 
cctgcagatg 
tattagactg 
cccccatttc 
gaggaggaag 
tatattcaac 
aatctgtgaa 
acagaaaagg 
aattgttaat 
tttatttatg 
gaagctggcg 
cagaatagta 
agacttcata 
accaaacact 
taacctcttc 
ttgcccccca 
ccaatcaccc 
gtgccgatgg 
ccacctacag 
ggtttctgta 
ccctcaaagg 
tcggctgcat 
cccagacttt 
aaggtcatgt 
cctctgtctt 
gtgacagtga 
tgctgtaagg 
cagtgtgtga 
agccagcaca 
acacctctga 
gaatcttgtc 
aggacacagc 
ttagttcctg 
ctctccagat 
gcctcctcct 
gaatctttat 
agtggcacca 



tggttaactt 
tgttaccagt 
aggaggggga 
agttaagggc 
ctacgtaggc 
taagtaattc 
tctccagagc 
aaatgattta 
gcactttagg 
ccaacatgga 
gcatgcctgt 
ggtggagttt 
aaattctgtc 
ttgggaaggg 
attaattgat 
ctaacctcaa 
agcaagtgcc 
acctcggtgg 
aaactgttaa 
cagtggcacc 
ttgtggggaa 
ccaaagatgt 
acccggaaaa 
gacgtatgcc 
gtgtagcttg 
tacatcggtt 
aaattgaaac 
ggaaggaatg 
aagctttcag 
aaagcctgtg 
ccttcatggc 
tcctttaaat 
agtaataatt 
aggccttggg 
ctaaaacagg 
tggttttaac 
ttcctgttcg 
tggactccca 
accacacaca 
atactgtcta 
ccctctcctg 
ctggtctccc 
ctttctgcaa 
agccagaacc 
acctccacct 
gagtgatctg 
gtgaggaggc 
taggggttgg 
ctgtttcagc 
atgttaacaa 
gtagaagcat 
agggatcttg 
ggccaataga 
gataagggtg 
ccccctaggc 
gagcagagga 
tgagctacac 
tggggcatcc 
atgtcctcYc 
gaggaggccc 
gcagaggaca 
taggcagaga 
ttttattttc 
ccataactca 



ttaactttta 
cagtataatt 
ggattaatat 
ataagaaata 
actgttatca 
acctggagcc 
atgcactctc 
ctccctaaag 
aagctgaggt 
gcaactctgt 
tatctcagct 
gtggtgagcc 
tcagaaaaWa 
agtgagtgga 
ttttttaaat 
taatggtttg 
aaagagaaat 
ctggggtaac 
tattaacagc 
ttgccactca 
ccaaattccc 
tcagagcagt 
tctgagactg 
tgtgacacag 
attttataca 
tggtctagaa 
gttttctggt 
tttgagttgY 
ctagcaggct 
ttgatattaa 
ctgaaccagt 
gagggggagg 
gattgtgatt 
ttgatactaa 
gtacttgaaa 
aagaagagca 
taagtgttga 
gaagggctgt 
gggtgactct 
agtatagaca 
gctaaataga 
tctctctaga 
cacccaatcc 
cagatattct 
gtgtccccct 
gcagccccag 
aacaacaagc 
ctccaagcag 
tggctcccag 
cgcagtggtt 
cctcatcgaa 
catccaggtg 
ggcactaggg 
atagaaggag 
cagggcagct 
gcaggagggc 
ccagggtctc 
gctgagggtt 
tgggcaagcc 
agacagggag 
gagactctca 
gagagggaga 
aatatttttt 
ctgtagtgtt 



gaacaaaaca 
aatgttccat 
ggccattttg 
aagttaaata 
tcttaatagt 
cctagctagt 
actctatttc 
aaaagctggc 
gggtggatca 
ctctactaaa 
actcaggagg 
aagatagcta 
aaaaaaggaa 
gccctcttta 
tttaacaaaa 
gattgtggtg 
acacctgcct 
caagctgcta 
ataatcagag 
acatacgggc 
cagactgtga 
gttcaaaggg 
gtgtcagtta 
cctcaggaag 
cttagggagc 
aggcgggaca 
tgacaattgg 
gataagaggt 
tcacagagaa 
tgccagagag 
ctttcaggtt 
cttagaatct 
attatgatca 
cattcaagtg 
tgcaaagact 
tcttaatata 
gaatgaagga 
gttattctcc 
gagattgtga 
gaaactgagt 
atgcctgttt 
tgagatttgc 
agagtgggct 
aacaggttcc 
cactgagaag 
cctggaaaca 
ccaggcagct 
gcccctatct 
gcacatccat 
cccgggaccc 
agcctctcac 
tctattgtct 
gagtgtgctg 
ggctcccctt 
gtctcaacat 
agagagagct 
gtcctcctta 
gaagctggat 
tgcagtttgg 
ccctccctcc 
gcaaagcagc 
gaggggtgaa 
ttttttgctc 
gacgttctag 



gaacatttac 
ggtaaaggtg 
taaagtgaag 
aactcacgtt 
tctgagggga 
gacagccaga 
atctgctagc 
taggcacact 
cctgaggttg 
aaaatgaaaa 
ctgaggcagg 
cattgcactc 
aatctgaaca 
ttaaatgatt 
cagaagttac 
ccaccacctt 
cctgatggga 
agaaaagtac 
tgaactccta 
ttcttttcca 
tatgtgttac 
atgcccactg 
atttagaaag 
tcctgatgac 
atgagacatc 
actcaaagtg 
ttgagtttgt 
tgtggagacc 
acaggctgta 
gcataatgaa 
aaattttaag 
tacttttggt 
tgtttacttg 
agaacatttg 
tatcgtaaca 
aatggatatt 
agagctgcca 
tatggacaga 
gcaaatgaga 
ctctgctcca 
tttctacacc 
tgagataccc 
ccatttcata 
ttctcgcacc 
aatgaactca 
Ytgactctag 
gtgaggtggc 
ctttcttctt 
tccagccgct 
cctgatgcgt 
ctccctcttc 
ggggccccca 
aggggtttct 
ccccttcttc 
catgagccat 
ggggcctcag 
ggcgttgttg 
actgcccctc 
tcaattcccg 
tgcctcagct 
tccctgcaag 
gacaccgtgc 
tgttacccag 
gatcaatcga 



atatgagaac 
ggaggtgaaa 
agtcaagaaa 
tagttctcac 
aaactaaagc 
agccaaactc 
ataagaatat 
ggttcacatc 
ggagttcgag 
aattagccgg 
agaatcgctt 
cagcctgggc 
tcaaaattag 
tgtctgaacc 
tttatagtag 
aaagttgatg 
gaaatgtcag 
ggatgttgtc 
tcttgaatca 
cccaaaggaa 
cccagaaagc 
gtcagtccca 
tttattttgc 
atgtgcccaa 
aattaagtac 
ggcgtttcca 
ctaaagaccg 
aaagttttat 
aaatgtttct 
gcatgtctga 
agccctggct 
ttataccgct 
gtccaagatg 
agaaaagtat 
gagaatgttt 
tttccttcta 
cagacatttg 
aataatgtca 
cagagcaatt 
cccacaaaat 
aagtcaactc 
actggtagac 
ggccctcctg 
ccttatggag 
actggttcaa 
agacttccat 
cctgtcaggc 
tactgtggct 
ggtctttgct 
tttcatcctc 
ctatttcctg 
tgcaaccatg 
gggtgcttct 
ctgctttgtt 
aaggaccagg 
accacacctg 
gcctggctcg 
gagctgaact 
gaaagggact 
ctgagccaag 
tggctgtggg 
tcctcctgag 
gctggaacgc 
tcctcctgcc 
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42721 
42781 
42841 
42901 
42961 
43021 
43081 
43141 
43201 
43261 
43321 
43381 
43441 
43501 
43561 
43621 
43681 
43741 
43801 
43861 
43921 
43981 
44041 
44101 
44161 
44221 
44281 
44341 
44401 
44461 
44521 
44581 
44641 
44701 
44761 
44821 
44881 
44941 
45001 
45061 
45121 
45181 
45241 
45301 
45361 
45421 
45481 
45541 
45601 
45661 
45721 
45781 
45841 
45901 
45961 
46021 
46081 
46141 
46201 
46261 
46321 
46381 
46441 
46501 



tcatccttct 
tcatttttgc 
cgagggatcc 
aatacctagg 
ctgggcacac 
aggtacacac 
attcccagcc 
tccccaccat 
agcgtccccc 
tctgtgtggc 
tgcggtgttc 
cagcctggct 
ggacaacatg 
tatctagatg 
tacttttggt 
agtcttatta 
gctaacctct 
ggcctgcatc 
ctgccaggga 
tacacacaaa 
catttaaaag 
tctaagtgta 
gttgagttgg 
tgaatagagt 
ataaaatgaa 
atccacaaga 
ccaggtgcgg 
acctgaggtc 
aaatatcaaa 
tgagacagga 
actgcacttc 
ataaaattta 
atgaaaaggc 
tttttgaaac 
taaaggtggc 
gatcgtcagc 
ctcctaggtc 
ggtgccatgc 
agggagtagg 
tggctcatgc 
gagatcgaga 
ttagccgggc 
atcacttgaa 
ctgggcgaca 
aaagaaatga 
gaggtgagaa 
tgtgtgtgtg 
ttttatttgg 
tcatctgagc 
agaagcccat 
ctgcaatgtc 
gtcagcagcc 
aaccctgagg 
gggggaaggt 
ataatccaaa 
ctcccacagt 
ggatcatgtg 
ctttcccact 
aaccaggtga 
ggaccctcag 
gtaggtgctc 
ctctctctct 
tggagtgact 
tcatgcctca 



gagtagtggg 
agagatgggg 
tcctgcatta 
tctcctgagg 
gggctcctac 
tgacgccctg 
tcaggccttt 
gtgcaagtca 
agccctcccc 
tctttgggcc 
ccagctgcat 
gaaatccgag 
gcccagcttc 
ctcatctctt 
ggctctatca 
cttttgtacg 
gaatcatagc 
tgggcccagc 
cttttacggg 
aaagtcttgt 
aagaagaaac 
ccccagaaag 
gaagaccttt 
tttacctctt 
agaataggaa 
aatgctcaaa 
tggctcatac 
aggagttcga 
atttgccagg 
gaattgcttg 
agcctgggca 
aaaatgggta 
actcaagctc 
attagataag 
acaagcccct 
gatactcagc 
tcagcctcca 
agcgttgttt 
cgggtcccta 
ctgtaatccc 
acatcctggc 
gtggtggcac 
cccaggaggc 
gagcaagact 
ggaggaaaag 
acacagggct 
tgtgtgtgtg 
ttgccagcat 
tctgaggagc 
gcaggcctcc 
aactttcccc 
tgctgggggt 
ctcagagagg 
caacacctga 
atcttccagg 
tatgctcacg 
gcagacacca 
agggaacagc 
gtgtcaggga 
cgagtcactc 
agagtcaggg 
cttttttttt 
ggtgcgatct 
gcctcccaag 



aactacaagt 
tYgtgctatg 
gcctYcgtgc 
aatctttaat 
tttcctcact 
cttccctgcc 
gctcaggctg 
cccatcctca 
aggctccctg 
tctgtctccc 
cccctatgac 
gtgtgggagg 
agtttcactt 
tttggggaag 
ggtttctgag 
aggRgaaaga 
catggtttcc 
tccctccggg 
aaagaggtca 
tcttctttgt 
aaattcttct 
aaaaaattaa 
cgaatcaaaa 
acaaattaaa 
gcacggtttg 
tcaacatgaa 
ctgtaatccc 
gaccagcctg 
tgtggaggca 
aaccccggag 
acaaagccag 
aaagatagga 
acttttgaac 
ccaaacctaa 
tcactgttga 
acactttaca 
gggagcagag 
ctatcaccaa 
acactatgcc 
agcactttgg 
taacatggtg 
gcctctgtag 
ggagggtgca 
ccgtcagaaa 
aagctgcaca 
ccatgaagtg 
tgaatttgtg 
gtcaaatgta 
caggcccacc 
agctctcccc 
tgactcgcac 
cagcagggta 
ggaagtgact 
cctcctgact 
tttgtttctg 
cttccacagt 
ttctgcaaca 
acagcttctt 
catggagtcc 
atctcccatg 
tgggtgaaac 
tttgagacag 
cgactcactg 
tagctgggat 



gcatcactat 
ctgcccaggc 
attagcacgg 
gacctgccag 
gctccctgcc 
tcccagagcc 
ctccctcccc 
gaggctcagc 
agcactctgc 
cgttaaactc 
tgacctgggt 
cagcacttgg 
tcagttgagg 
aacaaataag 
tttccattta 
aaaaagaaat 
aagcctcccc 
tccatgacac 
tttccttcct 
ctgtagtagg 
ccaaaaataa 
aactatacta 
ctcaattccc 
aattagaaag 
caatacttaa 
aagactgaaa 
agcactttgg 
accaatatgg 
cgcgcctgta 
gtggaggttg 
actccatctc 
tctgaagaaa 
catatatgta 
agtgactaat 
tggagtgtga 
aagtgccccc 
ggcaagtgtg 
aaagggagaa 
cattaagact 
gtggcggagg 
aaaccccgtc 
tcccagctac 
gtgagccgat 
aaaaagacta 
gaatcaaaga 
aaatctgaat 
agtgtgtgtg 
attgcacctg 
ctctcacctg 
actttcctgc 
tgccccgcct 
gggcatccta 
tccccagtgt 
ctgacattga 
cttctacaaa 
ttgactttta 
tgccgttctc 
tcttctctat 
tgtgatctac 
gcctcagttt 
aacctctggg 
agtctcactc 
caaccttcat 
tactggcttg 



gtcagtcaat 
ttgtcctgaa 
gaattacagc 
ggtcaagttc 
cacacagtca 
catctactcc 
ttggaatgcc 
caaagctgcc 
tttccggggg 
tgagcatccg 
ggcctctgcc 
actaaaagca 
gattgaatgt 
accatgactc 
ttttctcccg 
ttctaaaaac 
tcccccacgg 
tcttatgcag 
gcacttgttg 
tacccatgaa 
gtaaatcaat 
gaagaaatgt 
agaagactta 
aacatcattc 
agcaaaggta 
tcaagtttct 
caggccaagg 
tgaaactctg 
gtcctagcta 
cagtgagctg 
ataaataaat 
tagaaattaa 
agtgatcaca 
aatgtccact 
aaggttctgg 
acgtcatggc 
cagggacagg 
atcactaaat 
aaggaatgcg 
tgggtggatc 
tctactaaaa 
tcaggagggt 
ggtgctactt 
atgaatgcga 
cccttcgagg 
agcaggtgtg 
tggcatatgc 
agtttatagc 
ctgggcatca 
agcccctgca 
cccacctcac 
accccctMWg 
cacacacctc 
agcaagaaag 
cacagagggc 
atctcatgat 
actgcccctg 
cacccagagg 
ctgctcaact 
acccacaggg 
ttccttgtta 
tgtcaggagt 
cacctgggtt 
tgccaccacg 



ctttttattt 
ctcctgggtt 
cctgagccac 
aaccccttaa 
ccttgggtcc 
ccccactccc 
tcccccacct 
tggaccctgc 
cagctctacc 
gcagggaccc 
ctccgtggca 
accagcctct 
ttttccttgc 
tgatttcatg 
tgtcctcccc 
acgcgagcca 
caatggccca 
caaccaacta 
tgccttagaa 
ataggtccta 
cccaaaggga 
taaaaacaat 
aagggaaaat 
aagagccaga 
actgtccaca 
aaaaatatag 
cgggtggatc 
actctactaa 
cttgggaggc 
agattgcgcc 
aaataaataa 
cgttaaccat 
gagacatacg 
gttagggagg 
tgtcctggag 
acagcaccca 
ggctttagag 
ctcctgccat 
Yctggcacgg 
acaaggtcag 
atactaaaaa 
gagctggaga 
gcactccagc 
tgcctccctt 
ccacgacaca 
agtgtgcgtg 
aaggtttgcc 
aaatgttccg 
cctggatctg 
ggcatctggg 
cccagatggg 
gcaggtgaaa 
ttggagggca 
tataatgtta 
taagctcacg 
tgtcaagctg 
tctgggccat 
cctgaaaagg 
tcctgaYatg 
cctggctcaa 
tctcttttcg 
gcactcaggc 
caagtgattc 
cccagctaat 
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46561 
46621 
46681 
46741 
46801 
46861 
46921 
46981 
47041 
47101 
47161 
47221 
47281 
47341 
47401 
47461 
47521 
47581 
47641 
47701 
47761 
47821 
47881 
47941 
48001 
48061 
48121 
48181 
48241 
48301 
48361 
48421 
48481 
48541 
48601 
48661 
48721 
48781 
48841 
48901 
48961 
49021 
49081 
49141 
49201 
49261 
49321 
49381 
49441 
49501 
49561 
49621 
49681 
49741 
49801 
49861 
49921 
49981 
50041 
50101 
50161 
50221 
50281 
50341 



tttttgtatt 
taacctcaag 
accgcgccca 
gcagcagcac 
ctggatcacc 
agtcccaaca 
agagctgagc 
tgtttggggg 
gctctgccat 
caaaaggtga 
cagaccaccc 
gttctgagct 
tccagKctct 
cagctcctga 
atccttggtc 
cgtccttctt 
caggaagaaa 
ataacaccac 
tccatcacac 
tttgttgttt 
gtgcggtggc 
tgagtagctg 
gaaggggtga 
ctcctgcctc 
ggcttctgga 
gaaaagtcac 
tagcatcctc 
gctgttgtaa 
taaaagacaa 
gtgaacaagg 
gaaaaaggtg 
cttgcaagac 
agcccagcac 
tttacccaca 
aaatgctgag 
acattagcag 
ccacctaagt 
tagtttgtgg 
aaggggcaca 
catcattata 
aaattatggg 
agatgacatc 
cagaagcacc 
gtccctctga 
gttagcctgc 
gctctgttaa 
agtgctgtga 
ttggggaact 
tgctatgttt 
agagggggta 
ttctctggat 
gtggccattg 
ttcttatgct 
tggctggatg 
gatagcttgg 
agtgggtcat 
accccacccg 
tcgttttgca 
ctaaccactt 
tgtaggaagg 
gacacacaga 
cctggtaatg 
ctggattgcc 
gatgtctcca 



tttagtagag 
tgatctaccc 
gccaggttcc 
cctggggcca 
tcctgtcctc 
tctgccttcc 
cctaggaagc 
cccagagaga 
ggaccgagat 
atgacccttc 
ctaacttacc 
gtgtggatcc 
gagatatacc 
tcaaacatgc 
cagcagcacc 
ggccaaccca 
gacctaccac 
actccccaga 
atgccgggtg 
tgttttgttt 
acaataatag 
ggactacaag 
aatcttacta 
agcatgagcc 
caYttcctta 
gctcctctat 
ttcagcagag 
ttggccacag 
agacaaaaga 
gctctgagct 
gggggNtgat 
tgcatccttc 
cagggagtga 
tcccgcattc 
ttggtcacaa 
ctgggaggag 
tcagccccag 
ccagcacatc 
tgcccagtaa 
ataaaatgtg 
taaaaacatg 
gtcctttact 
ttaaactcta 
gagtgtctaa 
aattgtttgc 
tctcctcagt 
cctggatgat 
gaaacatatt 
cgctgcttgg 
tatcccccaa 
ttaggtttaa 
aagcccagag 
ctggccacag 
caatttttaa 
ataaatgcta 
aacctttgga 
aggccccctc 
attttttttc 
ttattcagaa 
ggtctccatg 
ctgccccagg 
gtcaaggata 
agctgcatcc 
cagacaaata 



gcagggtttc 
gtctcggcct 
ttgtatctca 
gcctcagggt 
caccttctct 
ttctggggtc 
tttgtgaacc 
gggagggaga 
gtgacctcag 
tcttatagag 
aaggaagctt 
cacctccagc 
ctggaaccca 
aaaacaggat 
cttggtYcca 
cctgcctccc 
tgaaagacta 
tgcccacctc 
gacgcttatt 
tgtttttgag 
ctcactgtag 
catgagccac 
tttttaccag 
acYgttccag 
tgtgggttct 
cagaatgcaa 
cccactgccc 
tcccaggttg 
gtatattagg 
tttcaagctc 
tgtcaggtaa 
aaataatagg 
tggccctcag 
atgataaaca 
tccctttggc 
aaaagagagc 
agccattcta 
ctgaagaaag 
catgaaactg 
gttttattat 
cattgcctaa 
cagacacagc 
tatagagggg 
ctgtgcttca 
tcactctcag 
taaaggatcc 
ttacgttcag 
ttgctaaaat 
ctaagagaag 
catgccacca 
ttagctgttt 
cttgttaaca 
agccaggaat 
aattactgat 
agtttagagg 
gaactcccca 
cctgccacac 
ctttttcttt 
tctaggggac 
ctgggcacag 
ggaggcggag 
ttgggatgtc 
acactcctat 
cacttgtgag 



tccatattgg 
ctcaaagtgc 
aaggaaatcc 
ggccaggcca 
ggttcctcca 
accgggccat 
cagctgagct 
agtccagcta 
tcagttacct 
ctgctgtgag 
ctcttgaaag 
cgtgcatctg 
aaagggaaag 
gcttcccagc 
ccctcctgct 
atacttaggg 
tgagaccctc 
actccctcca 
gcatattgat 
atggggtctg 
cctcaaattc 
tcggcctggc 
gttggtttca 
acctgctctt 
tctgccagga 
atctagcatt 
catgttccac 
actgaacaaa 
aagaaggggt 
tctgagttta 
ttgtcagtca 
tgctagatat 
gaaaccttct 
gtttgctgtt 
cttttcagct 
agaaagcctg 
agacccccct 
agaaactagg 
tgccctcagg 
acaaaagccc 
ttttagttat 
cccaacctca 
ctgacttcac 
atacactttg 
aagaactgag 
atcccagcgc 
catggctttt 
gcttcctttc 
ttgcaatgag 
ataactttta 
aacgtgatta 
gcccagggaa 
caccctcaac 
taagagagcc 
tttacagccg 
aatcgtcctY 
tgccataggc 
tgaacagagc 
aaatgttttc 
ccatggtcca 
cctccatcta 
gagactacct 
gtaaaaccac 
actttagtgg 



ccaggcaggt 
tgggattaca 
tctgtcccca 
caagaaggac 
aatattctaa 
gtctcacgca 
gattctgcct 
aaacctgggt 
aacctctctg 
gatggggaga 
agtgtgagca 
cataataacc 
tgaaagtcac 
cccacccttg 
gatccaagag 
ttgggggttt 
cagattactc 
tgccatcagc 
tgtattgctg 
gctctgtcac 
cagggttcaa 
tacttttaaa 
aactctggcc 
gtcgaacgcc 
gcacctgcta 
tcctcccctg 
tcctgtggca 
gtggggggYg 
cNgggggttc 
ttaggtaaaa 
gccRtttggt 
cccaacagat 
ggcggcgggt 
tgatcatata 
cccaacaggc 
tcactgcaac 
aactgatgaa 
gcacaggtgg 
tcaccccaag 
cagagtgggc 
ataaccataa 
acttctctcc 
ttcacagaaa 
ctttgagctt' 
attactggtc 
agaattcctg 
aatatgtgct 
gctttgggtc 
tgttgtattt 
gagaagggct 
aacaacttca 
gctttgagag 
tccctttaga 
gattgggatt 
ggaacgaagg 
gcccaccctt 
agcagcacag 
ttcatttctt 
agcaaggctg 
cacataggta 
aagagcaggg 
tgatctcagt 
agagggctgc 
acctcaagcc 



ctcgaactcc 
ggcatgagcc 
tcgggcctca 
aggccaggct 
aggaatggac 
ggctggaact 
ctgtctctac 
gcaaattcag 
agctcatcta 
tggggaaggt 
agatccagct 
agacacgttc 
aacttggcag 
gcccagtccc 
cagccccaga 
gctgtgtctt 
cccaccccca 
attaccacag 
ttggggttat 
ccaggctgga 
atgaacctcc 
aaatgatttt 
tcaaatgatc 
atacctgtgt 
ccttcacctg 
agaagccccc 
cttttgttag 
gatgtgggag 
cttgcttcta 
gagatgacga 
tcacagcagg 
aacttcaagg 
gcagtgtgag 
gcctccagtg 
agttaagtag 
agctcctggc 
gttggtggta 
aaattcccta 
ttcattatac 
ttttctgtga 
actgccaatc 
acaaaccgca 
ttagcccact 
gcattttgga 
cagaactcca 
gtaatatctt 
tgattttagc 
aagatgtttg 
gctgcattag 
tagagaaatc 
ttaggttgta 
gcaaaatggt 
ctgggcattt 
cagatcatgg 
gtaggcccat 
ttgggcactc 
cttctgaaga 
ttctttcttt 
tagtacccac 
caacttgaga 
gacactctct 
gctgtgtatt 
ccaaactgat 
tctctttaaa 
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50401 
50461 
50521 
50581 
50641 
50701 
50761 
50821 
50881 
50941 
51001 
51061 
51121 
51181 
51241 
51301 
51361 
51421 
51481 
51541 
51601 
51661 
51721 
51781 
51841 
51901 
51961 
52021 
52081 
52141 
52201 
52261 
52321 
52381 
52441 
52501 
52561 
52621 
52681 
52741 
52801 
52861 
52921 
52981 
53041 
53101 
53161 
53221 
53281 
53341 
53401 
53461 
53521 
53581 
53641 
53701 
53761 
53821 
53881 
53941 
54001 
54061 
54121 
54181 



agactgccaa 
gctttaagtt 
aatgggtgaa 
tgctttttaa 
atatttagct 
cacaacatca 
ccatttccat 
ttaaccgacc 
ttaaaagtct 
actatctgac 
attttttcct 
atgtgtcagc 
tttgtgtttt 
tattctggat 
ttctggtttc 
atttcacaat 
tttcattctg 
ttctgagggt 
tgatatttgt 
tgtctgctaa 
cgatagcatt 
agcactgact 
actccatttt 
ctaaccaggt 
attgatgagc 
aacattgctg 
acagggccaa 
acaactggtc 
acttatagct 
agaatccata 
gatatctacg 
agagagcaaa 
attaattgtc 
ggtagttaaa 
agtgaaacag 
tggatggagt 
acaggtggaa 
accccaagtt 
cccagagtgg 
tataaccata 
tgactcctcc 
acttcgcaga 
tgccttgagc 
agattgctgg 
gtgcaggatt 
cccgtggcta 
agaagggttt 
catcctgggc 
cagggaggaa 
cttctctatt 
gtttacaggt 
ggaggagcca 
tttattttgt 
ggatctgtgg 
aagagtggga 
gtcacattct 
catggataga 
gaagataaaa 
aggcctaggc 
aaaacctcgg 
tttcagctac 
agtgagccgg 
aaacaaaaca 
atttgtgttg 



aaaataagta 
ggattgttat 
ataatttttt 
actagattaa 
accatcaaag 
cagaaactgt 
tcctgcttct 
agaataagct 
ataccaacaa 
taagcactcc 
aatatttcca 
ttagtcactg 
attcatttcc 
ctaccctttt 
agtctcacaa 
attggttggg 
tcaagtactg 
attaaagcag 
cttaacatga 
ttaaacgggg 
cccctcccta 
ataattggcg 
tactgcccag 
caataccctt 
ttttaatttg 
ttttcatcaa 
caagttaatg 
ctggttttcc 
ttcaaacgat 
aaactccaaa 
tgggcccctt 
aaagagaacc 
accgatctac 
tagacattag 
cccctggccc 
ttttggtaaa 
gtctcctaaa 
cattatacca 
gcttttctKt 
aactgccaat 
ccacaaaccc 
aatcattccg 
ttgcattttg 
tccagagctc 
attggtaaca 
agaaggcgtg 
tgagaggggc 
aaatgtccct 
caggagggtg 
taattggaat 
gagRaaactg 
agatttgtaa 
catggttgag 
cccacaagtt 
aggaggagga 
tgtgaggctt 
agaacagtaa 
ataaacaggc 
acttggatca 
gtctactaaa 
tcaggaggct 
gatcatgcca 
aaacaacaac 
aagtgagcag 



atgcccactg 
agttattctc 
atgctaaaat 
cccttgcaat 
aagttttaaa 
tgaatctgtg 
gagtggaatc 
gtcaatccta 
acctgttaca 
tttgtcttct 
aggcttgcaa 
acaacaaaaa 
caagcccccc 
ccccagaccc 
cttcgggcaa 
atcccatcac 
cctgtgttcc 
acttacacag 
caaatcacat 
atttggccct 
tacacgctca 
ttctggctgg 
aaactcttct 
tcagaacaaa 
ctcacagcca 
ccaatcaggt 
gaaaacagag 
tggttagccc 
gctcatttga 
tgattctaca 
ggatggccac 
caattattac 
ccaaaaattt 
cagctaggag 
acctaagttt 
gtctgtggcc 
ggggcacatg 
tcattataat 
ataaatgatg 
caaatgacat 
cataaaagca 
ctctccctct 
gtgttagtct 
cagctctgtt 
cttagaggga 
ctgtcggatt 
tgaaccctga 
gcagcctcgt 
attacacaag 
ttgataacca 
aggcaaggca 
accaagWtgc 
gatgcccaca 
ttcaaaagag 
gggatgaaaa 
tgattagcac 
ttatgcattc 
agggcttggt 
cctggagtca 
aatacaaaaa 
gaggcaggag 
ctgcaatcca 
aataaaatga 
cagggtaatt 



actttcccta 
atatatttgc 
agctgtttta 
catttctcct 
accttgttta 
tttacacctt 
aatcagaaac 
gggtcgagcc 
acctccgagg 
atcttcattg 
tcacacttgt 
atgcaaacat 
aacagaacca 
acaccagaat 
aatcattgca 
tccttcctct 
cccattacat 
catggaaaaa 
ttttgtatag 
agtttacatc 
atctggtaaa 
tattggcacg 
taaaaattac 
ttgcttcatt 
atcaaggggg 
aaagtgcaaa 
tttcaggagg 
cctttcttgg 
tttacaaggt 
caaggttacc 
cacttttctc 
cactgcctct 
ggcttttttt 
tgggtaagaa 
agccccagaa 
agcacatccc 
cccagtaaca 
aaaatttaca 
ggtaaaacca 
catcctgtta 
ccttaaaccc 
gagagtRtat 
gcaattcttc 
aatccccacc 
aggggcctaa 
gctggcggac 
ccaacatgac 
ggccaccacc 
aactgtacga 
tctcttaaat 
tgaggctgtg 
gtctgagaca 
ggaaaaaaaa 
ggttttaagg 
aaaaagacag 
ttcctgaatc 
atctcatgct 
ggctcatgcc 
ggaggtcgtg 
gtagctgggc 
aatcgcttga 
gcctgggcta 
acatagatta 
tctagtcttg 



gtgccttttt 
agttcaattg 
aatgcagtaa 
aatgtttttt 
aacacaacat 
ccctagcaga 
atttccagcc 
cacagtcaat 
acctgtcgtc 
aggatgaggc 
accctatttt 
acctgatatt 
ctatgtggaa 
atcattttaa 
aaaaactgca 
aattgggctt 
gaactagcct 
ccctacttta 
aaacaaggca 
atccccaaac 
actgaaagag 
ggaactggag 
tcaacaaatt 
agcaagcata 
agcttgtgca 
ccaacttaaa 
ctttgactgg 
tcctaaaaca 
tgttttgttt 
gcgaggtctc 
tgcccaacct 
tgtcagcagg 
ctctaaaggg 
aagagagtag 
ccgtcctaat 
aaagaaagag 
caaaaacatg 
tgtggttgta 
tgccagttta 
ctcagacaca 
tgtaaagagg 
tactgtgctt 
gctcactatc 
tctgttaaga 
gcccactgct 
ttttctgctt 
cctttcactg 
ccatacacct 
tacactttga 
gggccaggag 
gctttcaacc 
gatctcaatc 
aagagacaca 
gcttcaatat 
gacaggtagt 
tatgtgctgc 
cagtaaatgt 
tgtaatccca 
accaacctgg 
atggtggtgc 
atccaggagg 
gagagggaga 
gaggaagcag 
gttttgtcat 



ttaaatatta 
cctcctcctg 
ccaatatttt 
tccaacaccc 
caacggactg 
ggtctatcag 
cccacttctg 
tgcatactcc 
tatctgtaat 
cattacttgg 
ccacccaatt 
ctaattcttc 
agaagccagc 
acataacaac 
gctccccata 
ttgattcctc 
tgacttccac 
gtatcactgt 
tatctctgct 
tcactcttat 
gagttttaac 
gacctcctta 
attaatctaa 
gtcttatgaa 
acacttgata 
aaaattacag 
tggaaattta 
gctttataca 
cttcccacct 
aaccatgtct 
accagctaca 
aagtagccag 
ggaaatatta 
aaaggctgac 
gccatcctaa 
aaagtaaggc 
ccctcaggtc 
ccctcctttc 
attttagtta 
acccaaacct 
ggcttatttc 
caatacactt 
agaggaactg 
atccatccca 
cagccacagt 
taccccaggc 
ggaactgttt 
gccagatttt 
aatataccca 
tatgatctga 
agcaactcgg 
cgtttagtgg 
agtcacagtc 
ttcaaaggta 
gagaagagtt 
atgtaaaagg 
gtattttaca 
gcactttggg 
ccaacatggc 
acgcctgtag 
tggaggttgc 
cKccatctca 
tcacatatgc 
gtacctgtca 
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54241 aggtaagtta ctaatttgca ttgtcatggt aaaattccac aggctctgtt ttaagggaaa 

54301 agatgttggg gctcacaagg aattttcttg tgggaaatcc atgaggaaga ccacctgggg 

54361 aatatgttgc cttctgtctt ggcagatacc tgtttaggga caaaaagaag ggcagttttt 

54421 tgactgattc aatttccctg cttaacggtt ttctttggca tagagagttt ggggccctga 

54481 tattttattt tcctttgaca ttatctctcc cttttacaaa atctttttga gaaagcattg 

54541 gagatgcaaa tgagtctctg gtcatgagtt tcatctgatt cctgtcagat gaagttgtct 

54601 agtttcagtc tgtagggtgt tcagggaagc acggttttaa tttctggtga ttccgagtga 

54661 gaaaaatgag agaaagaaga caagaatgat gcttcttttg agacttgcag ccaagaaaga 

54721 attcacgagc cagcctagat gaattttgga caaataataa aactggaata caatggacaa 

54781 agctagaatc taataacagg tattctacag tttcttttaa aacatatttt tttttctctt 

54841 cagtcctcta attctaccaa agacaaatca tggtaggact aatttatttg aaaaataagt 

54901 tgtagtctta tgatacttgg cctgattact ttcataaagt gcagtaagaa caatgattta 

54961 ccatatgggt tcttcttaag aaattggatt tgctggaaca tcttcttttt cataaggaat 

55021 ctcagattac accttttaaa gccttgagtt cagccaagta tttatctgtg cctgcaggtt 

55081 tctgtatgaa ttgggtgagt tcctctcttc ttgagattcc aagataactt gggggttcct 

55141 aggcctgtca gaaagtatta ttttccttac ttaccccaca tcccacatca ggaaccctgc 

55201 atagttaaag aatgaggcca attttttgat ggggctttca ttggctctat aagtcaacct 

55261 ctattcctca aagctgcctg ctcatatgtg aaaatatgcc atgccagtca aagccttgat 

55321 aaaataacta gtgtctccaa tggtgtccta ttgcaaagga aaacagattg ttactgaact 

55381 tatgcaaata attctattgc cataaattaa gaaaactcac aaatagtttc caaattctgg 

55441 agaaatcaaa tagagagaaa gaaatgtgct ttaacttttg ctcacaagag tRtatttcat 

55501 tcaactgtca aaagctgtac atagctcaca agatatcaac gtttccttga ctctgaagaa 

55561 gaaaaaaaaa agaatcagcc atgtttcaaa cacaaagtca tctaaaaaat tatttcagtc 

55621 ctctattaga tcagtcctaa gatagaaaaa aaatcaaaaa tgttattttg gagacttgcc 

55681 gccaggaaag atttcaggat ccattctaga taaattttgg acaaataaca aaactggaaa 

55741 aacaatgcac aaggccctaa tctagtaaca ggtgttctat aactcgttcc acctgatatt 

55801 gggttagcaa tcctcatgaa cataacaact ttttaattag agtcctggat tttttttctc 

55861 tagtccaatg gcatagcctg tagctgacca taaactggta ttgagaagaa ttaaagtaaa 

55921 acagtaactg tggatagcaa aagtattaga gcagccttgg caaacacaaa attgacaagg 

55981 aaatttggtt atttctgtga cacacaacaa cttaccgtaa aaatgattat tattactaat 

56041 aacatatatt gctgtgtgcg gtggctcaaa cttgtaatcc cagcactttg cgaggctgag 

56101 gtgggcggat cacttgaggc caacagtttg aaaccagctg accaacatgg caaaatccca 

56161 tctctactaa aaatacaaaa agcagcttgg tgtggtggtg cacacctgta gtcccagcta 

56221 ctcgggaggc tgaggcatga gaattgcttg aacctaaaag gtggcggttg cagtgagcca 

56281 agattgtgcc attgcactcc agcctgggtg actgagtgag actctgtctc aaaacaaaca 

56341 aacaaacaaa acaaaaacac catagaggaa gctataacag aattataggc atctcataca 

56401 attttggaac acatttattt atatataact caaagaaagt tacaacacca ttttacatct 

56461 gtcctgctag acccgtgcta gacacttgca agctcggaga gcacagtctt ggagagagta 

56521 aagaaaagac ttggagagag cagatgagac ataagcttta ttcagggaac ttacatacaa 

56581 ggagtccagg agcagcaggc tggacagaaa gcccctcaca tttgtaaaaa gcatgcagtt 

56641 tatatagtat ccttcccaca gcaacctcca cttagccacc tccacccggc aacctccact 

56701 tagcccaaaa caaagggcct tggttccctc caggacctgc ttcccaagga ctgggccagg 

56761 ttacaggtgt tcttcatagg tatagtgtaa acatctggat tggccattcc tggggtccat 

56821 agtttagaac aatgaacaaa cattcaccaa ggaacatagg gccattctca ggggatgctt 

56881 gaattattgc tgtctgacag gacatgccca ctaggcactc cactccagta tagccttcag 

56941 tggccctctc gtaccaagta agatgaatat gtctcttcaa gacttcaggg gacctaatat 

57001 ctaaaaaatt aatgaggtca aaacactgga tttgaaattt gattttagaa agtttgtcaa 

57061 ataccaaagg ttgaaaacac ttgatatcac aaaataggat tacgctcact ataaagtaag 

57121 tcattcgtgt ggccaaagag ataacttaaa tatttcaaaa acatagaaat gtttacacct 

57181 tgagagagag aagactcagt ttcccacaca ataacacctc ataaagacag catgggccgg 

57241 gcatggcagc tcatgactgt aatcccagca ctatgggagg ccaaggcaag cggatcacct 

57301 gaggtcagga gttcgagact ggcctggcca acatggcgaa accccgtctc tacttaaaat 

57361 acaaaaatta gccagacgcg gtggtgggca cctgtaatcc cagctattcg ggaggctgag 

57421 gcagggagaa ttgctggaac ccgggaggca gaggtcgcag tgagccgaga tcacgccatt 

57481 gcgctccagc ctgagcaaca gagtgagact ccaccgcaaa aaaataaata aacagcatga 

57541 ggccagtgga atctgtttct ctccctacca gcttttttgt agtttattca aaaagcaaac 

57601 agacgtcttt tattatcttg taatattaca tgaaaatctt gttcaaagca gcaagcccaa 

57661 atttaccttt gcttaacact aaagttaatt ttaatatctt agaaacaaat ctgtcaaatt 

57721 ttaattagtt acatcttaag gtaaaatatt ttataaatat tttataacct tttcaatttc 

57781 tattaaaaag cagatcaatg ctccaagaaa gccctgttat tctcatacag tggctcagat 

57841 tctgaccttt catcagtgtg gtatttatat taattttcaa tttataggaa aagtaaataa 

57901 tccccattac aattttgcca ccttgatcac aaacaaattc tttttacaag aataatcttc 

57961 cacacacctt ctataattgg ctgaaaactt tggtttggtc ctgtttttca tttaacttga 

58021 aataatcctt aaaaacttct aaactagata acattattat tattattatt attatttttt 
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58081 
58141 
58201 
58261 
58321 
58381 
58441 
58501 
58561 
58621 
58681 
58741 
58801 
58861 
58921 
58981 
59041 
59101 
59161 
59221 
59281 
59341 
59401 
59461 
59521 
59581 
59641 
59701 
59761 
59821 
59881 
59941 
60001 
60061 
60121 
60181 
60241 
60301 
60361 
60421 
60481 
60541 
60601 
60661 
60721 
60781 
60841 
60901 
60961 
61021 
61081 
61141 
61201 
61261 
61321 
61381 
61441 
61501 
61561 
61621 
61681 
61741 
61801 
61861 



tttgagacag 
ctgcaacctc 
gactacaggg 
ttcaccgtgt 
ctcccaaagt 
aacccacatt 
caggtataac 
tgggtaaaga 
gtcagcacag 
ggttgtactg 
atttgggaac 
tgttctcccc 
cacaaattac 
acttttatac 
tgaaccatgg 
taagctattt 
acttccttgg 
catgaaacac 
gtttttatga 
actacaaatc 
gcttacgtaa 
cgagcttcat 
actttgtcaa 
aactaggctt 
gaccacaccc 
gacagagttt 
tcaacacaac 
tgggattaca 
tttttYcatg 
gcctcccaaa 
ccttaattcg 
aagactatag 
attttaattg 
ctccctttct 
tgaagaccta 
gtggttactt 
aatttcatca 
atcgttRcaa 
tactacagac 
gaactagtcc 
gaagctaatt 
aaatgtccca 
tctttatctt 
aattctttga 
tgcctttcat 
tattttattt 
actaattcaa 
atcccttata 
cccactcaca 
aggagcagta 
acaaggaatg 
cattcaacca 
ttatacattt 
cagtcccatt 
aagcatatgt 
atgtagtgac 
agcctagcct 
tattttttat 
ttaggttcaa 
catactcagt 
ctacaaaacc 
cataactact 
ttccactcga 
tatcaatcat 



agtctcgctc 
cgcttcccag 
gcgtgccacc 
ctactaaaga 
tcttggatga 
cctgtgcctc 
tgactgggac 
aagacagatt 
caaagaaggg 
gaggctacct 
aggatgtcat 
cacctggggc 
tccttatcag 
aaaagataac 
atgcatggat 
tcactgtatt 
gttattctga 
tgtctggttt 
cctgagaagc 
tcaaatgact 
aatccttgac 
aatcactctg 
agttaacttg 
ccaagatgct 
atgtggtcat 
cactcttgac 
ctccgcctac 
ggcatgtgcc 
ttgatcaggc 
gtgctgggat 
ttcattagta 
atgcccaaga 
ttctcccatc 
gtcaaaacta 
agttttgact 
gaagttgaca 
gaagatgtcc 
tcccagggca 
aatgcccaga 
ccctaaagca 
agcattaacc 
actaaacaca 
atttcaatta 
atcacagaaa 
tgccctaaca 
ctatcatcta 
ctcatattaa 
ctaatttcat 
ccaactactc 
attactggtt 
cctatttaac 
atagccttac 
aactggaggt 
tattaccttg 
ttttacaatt 
atgatcttgg 
ctggagtagc 
tttttggtta 
gcaatctacc 
ggtttttaca 
caagcatata 
gtaacacttg 
actcaattac 
aatgagacaa 



tgtcgcccag 
gttcaagtga 
acacccggct 
tggtctccat 
caggcgtgag 
ttataacgtt 
caatattgtg 
tattagagaa 
acgtctgcaa 
gtggaatgag 
gctagttggt 
cccttcctcg 
attttagcca 
ctcctgagtg 
tccccctgtt 
cttatctttg 
aagcctctgc 
attagttcat 
ctttgcttca 
ctacggttgc 
gagaaaggaa 
actttctcaa 
taggttaaaa 
atgtcaccta 
tttcttttct 
tcttgttgcc 
cggcttcaag 
accacgcccg 
tggtcttgaa 
tacaggcgtg 
ctttatatta 
agtggagaca 
attatgtgtt 
tagggcatca 
cctacataat 
gttgagtagt 
tatatttgtg 
tttaaaccaa 
aatctgcaga 
cttcgaaaat 
ttttatgtta 
tctacattat 
aaattctcaa 
cttgaaattc 
gtaatacgaa 
gtcacctgat 
aacaaatgat 
gtattctgtt 
aactctcaat 
tttgtcacaa 
ttctctcttt 
ccatatgatt 
gccatcttag 
attttactag 
tttttttgag 
Yttactacag 
tgggactaca 
gatggggttt 
tgccttggcc 
cttctagtga 
atagagtaag 
ttgaaaggca 
tctattaacc 
tatcagagaa 



gctggagtgc 
ttctcctgcc 
aattttttgt 
ctcctgacct 
ctagcacgcc 
tttaccaaaa 
gacagtaaaa 
agtacaaaga 
agtgtcaggg 
gtgcagctgg 
tgtctgtgat 
ttgttgctta 
ggacaaacag 
aaaccaataa 
ctcaggctac 
caggtgtaaa 
agtactctga 
ttatctcact 
aaagaagaag 
ccaccaccat 
ggaatcaaac 
ttaaaatgta 
ttatttatct 
tcatagaagg 
ctgttttttt 
caggttggag 
cgattctcct 
gctgattttg 
ctcccgacct 
agccaccacc 
cttccctgat 
atatgaacta 
ctaatacata 
atgatattga 
tcccacaaca 
tctccctata 
agccatacca 
gcaaccttaa 
tccaaccatg 
tggtcagcct 
aagaccgaga 
ttactatggt 
aatttatctc 
cttgagaaat 
gacccgtagt 
taacaactgg 
aataatacat 
tattggctca 
aaatttaggt 
aacaaaagca 
tctagtcatc 
gacagccaat 
tatcaacatc 
ccgttcttga 
acagagtcta 
cctccaactc 
gtttcatgtc 
tatcaatgtg 
tcccaaagtg 
gcctctactc 
cttcagccct 
ctcattacaa 
ctaggcctag 
ggcacattcc 



agtggcacga 
ttagcctcct 
atttttagta 
cgtgatccac 
cggctaatta 
acacatccta 
taatttaccc 
cacattgcaa 
gctgaaggga 
ggctacatgc 
cagctgtctc 
cttatcttat 
ctgatatttc 
cgcttaacta 
acactagaca 
ctcaaccttt 
ttatcccgat 
aacagcaatc 
ttttagtagt 
accacacatt 
ctcctaaaac 
ttagtaaaaa 
ctttatggca 
aagaactcct 
tttttttttt 
agcaatggca 
acttcagcct 
tatttttatt 
taggtgatcc 
gtgcctggcc 
actaacaaat 
tgttgcttgc 
atagatgaaa 
agctacaaat 
gacttaaaat 
gaaatattaa 
tcacatcaat 
catctatacg 
gctttatacc 
ctgtatcata 
gccaggattt 
tttatgtata 
cccaataagc 
Raaatgaatg 
tataatctcc 
ctaatctcca 
aataccaatg 
actaatttac 
atagctattc 
tcccttgctt 
gtcaaaacca 
attacaactg 
tattagccta 
atttgccatg 
gctctatcac 
ctgggctcaa 
accatgtctg 
cccaggctgg 
ggattatagg 
acatgatagt 
tggccattca 
catctgtctt 
taactaatac 
atggccacca 



tctcggctca 
gagtagctgg 
gagacgtggt 
ccaccttggc 
cgtttaacaa 
ccttttgtca 
aagacagtca 
aagtgcaatg 
agtttcatag 
ggagtgaggt 
tcagaacaat 
taggactcca 
tggcttttta 
aggtcattac 
gaaagtgagc 
tccccacggc 
gtatatacta 
atactaatac 
tgaagtatca 
tgaagaacca 
tggttccaaa 
taattacata 
tatccattta 
tcattttcat 
ttttttttga 
cgatcttggc 
cccacgcagc 
agagacagga 
gcccgcctcg 
gtgatcattt 
taacttgcac 
tagtatctta 
ttaataatct 
atacagatta 
caagagagct 
ttcatatgtt 
aggcctaaaa 
accagacctt 
cattgttctt 
atatcactgt 
ctccacatta 
atcatatcac 
tcaacactaa 
gaaatgtatt 
tatttccaag 
ttcaacagcg 
gatgaagctg 
tRggaggctt 
cagtatgagc 
atttcctacc 
ttagcctttt 
taaatgtaaa 
cccacagcct 
gttttaatcc 
tcatgctgaa 
gcagtcctcc 
actatttttt 
tctcaaactc 
tgtgagacac 
actttacgac 
caggggcgct 
agccatatga 
attaacaatg 
cttgtcaatt 
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61921 
61981 
62041 
62101 
62161 
62221 
62281 
62341 
62401 
62461 
62521 
62581 
62641 
62701 
62761 
62821 
62881 
62941 
63001 
63061 
63121 
63181 
63241 
63301 
63361 
63421 
63481 
63541 
63601 
63661 
63721 
63781 
63841 
63901 
63961 
64021 
64081 
64141 
64201 
64261 
64321 
64381 
64441 
64501 
64561 
64621 
64681 
64741 
64801 
64861 
64921 
64981 
65041 
65101 
65161 
65221 
65281 
65341 
65401 
65461 
65521 
65581 
65641 
65701 



gtccgaaagg 
gctggattct 
tccaacaggt 
ctcagctttg 
acacatactc 
agcctcagaa 
ctctatcgct 
ctgccttgtg 
ctgctctaca 
agtactgttc 
caacacacca 
ttaaaccaga 
atttcatttg 
aattccacat 
ctaatgaact 
caacacttca 
tctgaattct 
acctctttat 
ttataccaat 
tcattaagtg 
ccaggaagaa 
tccctgattt 
aatccacata 
ttaacatgct 
aatcttcacc 
tcatcaaaca 
cttaatattg 
ttctcagcaa 
cagcagcctg 
ttagactgca 
tccacttgtt 
cttgcagaag 
atggtgggag 
gtcaacactt 
aaatgcacgt 
cctgccccct 
agcccaaaaa 
ttgtaatctc 
acatgattgt 
agttactaaa 
gagaaaattg 
tgcctaggtt 
ggttttaata 
tctaatttca 
ccacgacccc 
actaagctat 
attaacccaa 
ctaaacttta 
attttagtat 
gcttagccat 
atagcttaag 
tattcaataa 
aacaatctca 
gtgggtcatc 
catcatgacc 
catttgaccR 
aatttttttt 
ctcggcttac 
gtagctggga 
tggggtttct 
gcatcgacct 
aatgttgaat 
ttataataaa 
caaaaatcta 



gccttagata 
tctgagcatt 
atttacctcc 
ggagtatcaa 
caaaccttat 
tgcttccagg 
acagacttcc 
catcaacgaa 
atcaaagaca 
tcaactgacc 
cccgacatta 
agggtgtctt 
ccaatgcaat 
taaaccagaa 
aggaggcatg 
catatctaaa 
agtagcagac 
tattatgagc 
tactagccta 
actctggtct 
gcttcttgct 
ttgtacttga 
aattactgtt 
agtactagat 
tccatgaata 
ttctgcagaa 
cataggacWt 
cacgcttatc 
cttggaaagt 
actatacctg 
cctcttaaat 
tactgacatg 
cctgaaccca 
acaaccacaa 
tcaagctcac 
tttccctgga 
caagatatgt 
aactacaatt 
aaaataaaaa 
tattgcccta 
tctagatccg 
agttcatgta 
agattacaca 
aaaggagcag 
acgggaaaca 
actaaactat 
actcacaaaa 
aaccgagccR 
cctgagaaca 
aaacttaaat 
cctcaaagga 
gctccgatat 
caggaagtca 
cactttatca 
cctgtccata 
gcagaatgtg 
ttgagacaaa 
tgcaacctct 
ttacaggtgc 
ccatgttgac 
cccaaactgc 
atgctgcagg 
tgtcctaact 
tacaattaat 



tggaataatt 
caagcctggt 
ttaacccctt 
ttacctgagc 
ccattacgat 
cacctttcac 
atggacttca 
aatttcactc 
ccttggatct 
tattaggaga 
aaccagaatg 
ctatttgcct 
tccacatgaa 
tggtactttt 
ctggcccttg 
caacaaggca 
cttcttacac 
caagcagcat 
atcgaaaacg 
gtaaaccaaa 
ccacaaccag 
agtgcaatac 
gcaactatgt 
gtgcctgatc 
tttagcagga 
agcgattaac 
tcagttattt 
acctcctaaa 
tctatccttc 
gcatctggtt 
aagacacctc 
tatttggtag 
gccaactcca 
ggtgctaatt 
cctttctcca 
ctaacctaat 
ataacatcaa 
acattttagc 
ccaggctatc 
ataNctagca 
aattaatgta 
actccatRaa 
tgtaagcatc 
gtactaagtg 
gcagtgataa 
tagggttggt 
acaagcattc 
tgaaaagctc 
tgatagctaa 
aatttaataa 
catggcgatg 
acctcagcat 
ccccatctat 
tctcttattc 
atgtaattca 
atttagagct 
gtctcactct 
gcctctgggc 

3C3CC3C 

tgggctggct 
tgagattaca 
ttcattggtt 
actattattt 
tccattacca 



ttatttatcc 
cccaatccca 
cgaagtgccc 
ccatcatagt 
tgccctggat 
cacctctcat 
cgtcatcaat 
atcagactYt 
aattctcctt 
ctcagacaat 
gtgectttea 
atgeaattec 
accagaatgg 
catttgecta 
tcttctccat 
taatattctg 
ttacatgaat 
ccattgtgta 
agctactcaa 
aatggaggac 
cacccaaagc 
atttatttta 
acatcatgea 
gtatattgta 
gtaataatgt 
atacatatat 
atgaatcata 
aacattctta 
ttactcccgg 
cttccttcag 
aatgatttag 
tttttaaatt 
ttgtagctgg 
tattcatget 
aacatgtatc 
tatgecaaaa 
tgacagccag 
caaactttca 
taggctaaaa 
tagtacttca 
gcttatgata 
cacaaaggca 
tggattccag 
cacagcccta 
aaaaaaaaaa 
aaatttcatg 
agcgtgttta 
aagctaaaat 
ggcccaaact 
acagaattat 
ctttacagct 
ctcttggtaa 
taattattct 
ttgeacaata 
tctcaacact 
agtctcaggc 
gtcacccagg 
teaagegatt 
cctagctgat 
ctggaactcc 
ggagtgagcc 
gtttcttcat 
ttccaggagt 
aactctccta 



catcagaagt 
gaacteggag 
ctactcaaca 
ttgaaagaag 
atttacttta 
ggcatatatg 
ggctcaacct 
tgecaaatte 
ctaatattac 
tacactccag 
tttgcctatg 
acattaaacc 
caactttcat 
tgcaattcta 
tcttatccta 
accattaagY 
egggggecaa 
cttttccctt 
atgaagaagt 
ccggctcctt 
tgaaattcta 
actgetatgt 
ttcctgcttg 
tatttatgta 
cacagcaacc 
atcatccagt 
gcacattcaa 
actaacaagc 
cccatgacac 
ggccatggag 
tgacaatcac 
gggagatget 
agtttaatgg 
tgacagatat 
tgatttactc 
aaaattgttt 
ggctcacact 
attaaaccta 
atcccacttt 
acccttcatt 
ttaaaacaag 
tagttctggt 
tgaaaacgee 
catcataaca 
aagcaataaa 
ccagctaccg 
aggcaattct 
aaaaagaaac 
gggactaaat 
ttaccagagc 
ctcgagaggg 
ctcagcctgt 
tttatcagtc 
atetgeatga 
agcagaaact 
ttcagtatag 
ctggagtgca 
ctcgagcctc 
ttttgtattt 
tgacctcatg 
attgcgcccg 
agcagaatat 
actacatgat 
ctgagaaccc 



agttttctct 
gctgctgacc 
tatccattct 
aaaaccaaaa 
cactcttcca 
gctcaacatt 
tcctaaccat 
cattgeacat 
tactcatatc 
caaatctcct 
caattctaca 
acagtagtgt 
ttgcctatgt 
cactccattc 
gctgttgttc 
caatgtctRt 
cctgttgaac 
atccttcttt 
ccttgtagta 
gggaccattt 
cttaaactat 
cagattggaa 
accccatgaa 
tegtacctta 
tgtcatacat 
ataataatcc 
gtcaaatcat 
tttgagaaat 
ttgggggtaa 
actaagatcg 
ctcatggtca 
atgactcagc 
aatattacga 
catgaaaaga 
ctcaaactcc 
gttcttgeca 
ccaaatcaaa 
tttcaaaaac 
ttatttctaa 
ttgtaacaga 
gcactgaaaa 
ctttttattg 
ctccagatca 
ccttgctcaa 
tgaaagtttg 
gagtcataca 
cataataaag 
tacaacaatg 
ctctcactgt 
actataagca 
gcctgttcta 
atactgetta 
cccttgataa 
ttgattttac 
aaccgagccc 
ttatttattt 
atggtgcgat 
agcctcctga 
ttaatggaga 
tgatccacct 
gcccggcttc 
actaacatta 
gtctttatag 
tgtttctgtg 
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65761 
65821 
65881 
65941 
66001 
66061 
66121 
66181 
66241 
66301 
66361 
66421 
66481 
66541 
66601 
66661 
66721 
66781 
66841 
66901 
66961 
67021 
67081 
67141 
67201 
67261 
67321 
67381 
67441 
67501 
67561 
67621 
67681 
67741 
67801 
67861 
67921 
67981 
68041 
68101 
68161 
68221 
68281 
68341 
68401 
68461 
68521 
68581 
68641 
68701 
68761 
68821 
68881 
68941 
69001 
69061 
69121 
69181 
69241 
69301 
69361 
69421 
69481 
69541 



aatcagggca 
actttctatt 
ctagtattca 
aattacagag 
aagtcaaaat 
taagggcagc 
tatgctRgtt 
cactattttt 
ggattagaaa 
ggtggctcac 
ccaggaggtc 
agtgactcat 
ccatcctggg 
ccccagtggg 
ctggaaatca 
ttcgtgacac 
ctgaaggcct 
ccgctctggg 
tcgatgtatg 
gggttctgaa 
ttggcccacc 
tcccacaatg 
ccaggagaca 
ttttgtcacc 
aacctcccaa 
ggtgtcagta 
tccatgtgtt 
aggtggatct 
actagtggac 
ttgtgccacc 
aggcagtctt 
tatacatggc 
gttgagactg 
tcaggtcagt 
ctgttgcttg 
tgtccagtcc 
gccaaatgag 
tttactgtct 
ctcttcccag 
tgaggcctga 
ggtcttccct 
gtgtgtgtct 
ctgtgagctt 
attttaaaaa 
tgcgctctcc 
ggttgcagat 
ggagggcaga 
atcttgtgga 
atgcctggct 
ggctgacttt 
ctcgccagat 
cctcccctgc 
tggagctcag 
ggaagaagaa 
cctgtctcct 
tctctcccct 
gaaggaccct 
agatccttat 
aagtcccagg 
gcatcaagaa 
gtctaagggc 
gtctgctcct 
gactgaactg 
ctccaaagag 



tcagacctat 
ccttactcta 
gccacaaaca 
gtttaaatct 
tctccatgct 
ttaccctatg 
tatacccttc 
atagaaactg 
taaatatgct 
acctataatc 
gagaccagcc 
tcaaagtttt 
acagtgctca 
ttacatgagc 
tggcccatga 
taagagcggt 
gtagtagcct 
gcgggggggc 
tgacccggtg 
cagctgctag 
tcatggctgc 
ggctccagca 
gcttaggact 
tgacgtggat 
ctgtaagcag 
tccatttgca 
tgcctggcag 
attgccatgt 
agggcatccc 
ccgggaggcg 
tactgggaca 
caatagctgc 
aaaccctaca 
ggcgtctcta 
agcttgtttc 
catttgtttc 
gaatgaatat 
gaggaacagc 
gaaattggag 
agaggagctg 
aaacctggct 
gtccatacat 
tctctctgga 
gctttcgctt 
ttgcttcNaa 
cacaggccct 
gcaacagtga 
aaggggcaag 
ttgacatttt 
cctatgaaaa 
cttctaagcg 
tgtaacaaag 
aagtcctaaa 
gccattccat 
tccttcctct 
tctatcatct 
tgtggttaca 
ctggacgata 
gttaggacat 
aattacgatg 
ttttctgagg 
gccccactct 
cagaatggaa 
ccggagacca 



gagtttgata 
gccctatgca 
tgggaaatat 
tcctatttct 
atctgataca 
gtaatggcag 
ctgtactaat 
taattatagt 
agtcataatc 
tcaacacttt 
tcggcaacat 
catggtttca 
tatgtgggtc 
tgtggaacct 
aagggggccc 
gagacRcttc 
cattgttgta 
caataaaatc 
tggagggaac 
attatctgca 
cccctggtgg 
aaccatggga 
atgtgcatca 
attctgccac 
cccattgact 
ccgtcagcac 
atatcgtaag 
ccRctctttc 
tttgttgcag 
ggcttggcta 
gtctgctttc 
tgctccagca 
ggtaccattc 
gtgacttcca 
acccatattt 
acctttcttt 
cctccagtag 
atggatgcgg 
gacctgaagg 
gtttcccact 
ttaaaattgc 
gtgtgtgtgc 
ttcttccaat 
cacttttgtt 
aaaaaaatgt 
cagatgattg 
gccaccccca 
gaacgctggS 
tcctgaaggc 
gcaaattctg 
cacatgaggg 
caccacagac 
atcaaggtgt 
tgcctcttcc 
tcaaagtcag 
catctccttt 
ttggacccac 
actgcaaggt 
gcacatctgt 
gggatatcaa 
tacaggcccc 
ctgtttctag 
aggagtgagg 
tttgcagggt 



tgatcaactt 
tacagaatgt 
gtctgataag 
agaattacag 
ccatgtccta 
ctaaataagt 
cagttctcta 
aatcgcctta 
cccataaaag 
gggtggctga 
tcatagggag 
gggagattct 
tttagcaggg 
ccagtctatg 
tgaYctctat 
atgStctcca 
tccacacagg 
gactcgccag 
ctttctagtc 
cagtgggcag 
tcgctcttat 
acatgcttaa 
acRtggtaca 
atggacaggc 
gctttgtcgt 
tgaccgtaag 
ggtcgcctca 
aagcacgtcc 
agaggcatgg 
ggagaccttc 
tggcgatggc 
ctgtaaagtg 
ctgtgggttg 
gtacaagagg 
tgttttgtca 
attagagtgt 
cccagtaaac 
gcttgatggt 
aaaaccctta 
gctgacaccc 
ctgtgtgtgt 
acacccatag 
acatgtctct 
cttgcatttg 
taactgatac 
gcatatttgg 
accccagcat 
acctgaagca 
gtctgataaa 
taggtttctg 
ccctgatagg 
acagtggctc 
gggcagagct 
agctcttgga 
ccttgtagca 
tctccctctc 
ttaaagaatc 
cccttttatg 
agggggctgt 
ggagtttgaa 
gctagaggaa 
acagcctgcg 
cgctgcagca 
ggggtctgag 



atgcatcttc 
ttaaatatct 
agagttactt 
gaattgaaac 
tagtaatagc 
tatcaggccc 
atcttcctca 
cactgtctct 
gccaaaccca 
ggtgggtgga 
tgaggccatg 
tagtctagga 
cactcttcca 
gattgttctg 
ctgtcctttt 
gttgggtggc 
ttaagtcata 
tcccgcacag 
gcagccatga 
ggaagcctac 
aaggtagcca 
gggcgcctgc 
ccataagatg 
cctcaaaggg 
tacccccatt 
tgttaggtgc 
tatatggtag 
tgtagtactt 
catttctgta 
cattcaccct 
ctctgctcRt 
agattctgct 
catttgcgac 
gtgctgtggt 
gaagcctctt 
ataatgggca 
ctaggaaaac 
cccctggggc 
gaggcagcag 
ctgaggcttt 
gtgtgtgtgt 
atttctttat 
actgcagagt 
cttttgactt 
aaacattgtc 
aataagtttt 
tccggggagc 
ggataggtca 
ttgtggctgt 
gttacatgat 
gagagagtgg 
aaaccaacac 
gcgttccttc 
ggaaactgct 
tcctccgatc 
accctactgt 
ccaaataatc 
atggaaggtg 
tattcagcct 
gaatcccaac 
gctgctccaa 
ttcaagtcat 
gaaggagctc 
ggaggttggc 



ccgtgaaaaa 
gtcctaatat 
tgacagagta 
tagtcctaag 
agctaataag 
atacccaaaa 
ctatcctatt 
taatctgaaa 
ggccgggaat 
ctgcttaagg 
cttggtagtg 
cccccaagag 
ctgaggtctg 
atgcccagtg 
agccatgcca 
acgttttaca 
ccgtttccct 
gttgtatggc 
acaggtcccg 
agtctttgct 
catggcagcc 
ctcctgattt 
cacatcattc 
gcctgcccca 
ctatccRtat 
tcttgctagt 
ttggcatggt 
gacttaaagg 
ggatgtgtgg 
tgtataaggt 
tgcaatgtcc 
cccttccaca 
aaactccaac 
catgaggctc 
gctccatatc 
aagggattgt 
ctgcagttgc 
ccatcggagt 
gcagtggagc 
ccggatccct 
gtgtgtgtgt 
ttatttaatt 
tagccagagg 
tgttttgctt 
ccaaagagtt 
caggctgaga 
acctgctgtc 
cttcagacaa 
tgatattttt 
aaatttcgtc 
tgtgttagct 
gaagttcttc 
tggaggctct 
ggccttggct 
tctcccagtc 
ctctgcctta 
tcccatctcc 
atataatcgc 
aaaacaagtt 
tcgtagccat 
ctctgttcct 
tgtggcccaa 
gctgtcaaag 
aaggaaaggg 
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69601 
69661 
69721 
69781 
69841 
69901 
69961 
70021 
70081 
70141 
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70321 
70381 
70441 
70501 
70561 
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70741 
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70921 
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71041 
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71161 
71221 
71281 
71341 
71401 
71461 
71521 
71581 
71641 
71701 
71761 
71821 
71881 
71941 
72001 
72061 
72121 
72181 
72241 
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72541 
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72661 
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72781 
72841 
72901 
72961 
73021 
73081 
73141 
73201 
73261 
73321 
73381 



gagcatcatg 
tggcttagat 
cctctctatg 
tgtataggag 
ctgccagggc 
gagctctcta 
aagaggcaaa 
tcaRgaagca 
aacagagctg 
tgtcacaggt 
tagcacagac 
atttcccata 
tcccctcaga 
tccagactga 
gtcagagggg 
caataagtct 
attcttggaa 
acgccggtgg 
caaattcatg 
cagtgattcc 
caactgtttg 
cctaaggtgt 
atcaggcttc 
gaaattcact 
ggaagataaa 
gctttgtgag 
aggtgcagga 
ctgagaatgg 
tgtggaagca 
gggtagggag 
tctctatgac 
gagttccctg 
attgtacaag 
cagaccatga 
aattctgctg 
cacctctcag 
gaaggagMRg 
gctgtaagtt 
ggaagaggta 
aaaactaccc 
atacagggaa 
attttcagaa 
acggatccct 
cacagcctta 
gaactgcctt 
tgatagccag 
ggttacaaac 
tttaagtcta 
ggtttcagag 
tcactgggtt 
acagcaggac 
gttgtaaagg 
gaccccacaa 
cttgtgacgg 
cctacaaata 
ttgacagtgt 
ggtgtgtgtg 
tggattcata 
tcaccaggct 
caagcaattc 
cctggctaat 
ttgaactcct 
gcgtRagcca 
gtgtgtgtgg 



tggattcctt 
tctaattgtg 
acaccaaaag 
accggagggt 
ccaggagatt 
gaatggggga 
ccctagggtg 
gggagggcgc 
gtggacagtc 
catcatgctg 
ccagagccct 
agtgcagcct 
tcacagggga 
caccaattcc 
gggttacagg 
atcacctggt 
gcttcagatt 
aagccgttgg 
gaggagttag 
tgtgactcac 
tcttccaaca 
tcttgttaat 
ttggtcaagc 
ggaatcatat 
aagaagccaa 
gagtgacacg 
ctcggtcgcc 
gttctaagca 
gccaattagc 
tgggagaatc 
tacacatgct 
ctgtgaaaca 
gcagacccct 
gggctgaatg 
tgctttcctc 
ggatggttcc 
tgtcctcggg 
cagcatttta 
tacagataaa 
ctgtcccatg 
gaagtcactt 
tgggctccat 
cctcaaggct 
gagagcatgg 
taggatgtgc 
aatttatgga 
aataagtcac 
gaagtttcag 
gtgcccatag 
agatataatc 
atgaggaatc 
tgaataggaa 
gaatgacttc 
tggcagcaat 
tacacacaga 
gccgaggcct 
gctctctttg 
acaggtaaRg 
ggactgcagt 
tcctgcttca 
ttttgtattt 
gaactcaaat 
ctgtgcccag 
tagcaagcag 



tccatagtYg 
cccaggactg 
gccagaaatt 
tagactggat 
cccctttcct 
ggggctagga 
aggcaggtgg 
atgatctggg 
tactgacttc 
gatggcccct 
tgaaggaaag 
gtgagcgttc 
cggcttaaag 
tgggacccat 
gaacaagtaa 
tattattttt 
gttcacgtag 
ttttacatac 
agagattgct 
ctgttcacct 
gcagaacaaa 
gaagcaaata 
agaggtacca 
cagaagagcc 
gacagtgaca 
gccacagcgc 
ctcttctcta 
ccctgggctt 
cagcctagct 
tacttgcctt 
ggcaaacScc 
agtttgttcc 
aactctaatt 
caggtgctta 
ctggcccacc 
tgaagagcag 
aaggatctgc 
gaaggaatac 
gattttagaa 
tgaatactga 
ttgtggctgc 
aaccaggtgc 
gagcacttgg 
aaatggcccg 
tatgttttga 
gctggacagt 
tttcaaaaat 
tatctaaggt 
ctggcttctc 
atggatcatg 
gagggatccc 
atttcaccac 
gcccataagt 
ggtcagtggt 
aaatgctgag 
tcctcctgtg 
gagaagaagc 
agagtttttt 
ggcgtgatct 
gcctcctgag 
ttagtagaga 
gatccgcccg 
tcgaaaaggg 
ttgactatat 



tcatgagcta 
gccatgcaac 
acctggcatg 
ttctcaggcg 
ttgatgagaa 
ccctggtctc 
tggccagaag 
ctcagggacg 
gcaaaacccc 
gcttgtcact 
ggaggccagg 
ctcctacctg 
gaaacccatc 
aatgacccag 
tgaatgaata 
taaattcctg 
acccacggat 
ctgcctaaat 
gccatcatca 
cttcaatgca 
gcaggaagct 
aacacagcct 
tctctggttt 
ccacctgcag 
cggtatcaag 
tgccccgcca 
ttctcctcct 
tgcctcactt 
cttggacagg 
agaatcacag 
caaagtagga 
atctcttggt 
tgYgaaaact 
cactgagtta 
tgctcccacc 
tggagcacac 
cttgtttgct 
gtctggaggc 
tgagcccaga 
ctagaaagtc 
aagtctcctc 
gtgatgtgga 
catcgtccag 
aaggggactc 
attcataccc 
aagaggttaa 
cttgtttaca 
gggaatgcct 
atgctccctg 
atcatccagg 
ctaggattgc 
cttactgagg 
aagaggaaca 
ggttgtagcc 
ataggagctg 
acataacttc 
aagtgcattt 
gtttgttttt 
cagctcgctg 
tagctgggat 
cggggtttca 
ctgcaaccac 
ttctttttgt 
aMaataagca 



acacacttcg 
ccttgagtaa 
ataaagaaga 
ctttctgccc 
gaatcagtgg 
tttagggtgg 
aaaagagtgt 
ttccacatgc 
cagtctggtg 
caattctcag 
agtctgcaaa 
ttctccctgg 
cccaggactg 
agtgccactg 
tctgatttca 
ggtggataac 
gatgactatt 
attttaatca 
aggtttaaag 
gccagtcgtc 
catcagtRga 
gcagctctca 
ttaagggcag 
aggaggctgg 
gctgcgagca 
cctgcggaca 
gggcagtcct 
gctccagttt 
agccctggat 
agggaaatgg 
agaggaaaat 
tctaaatatt 
aaatagagat 
tgagtactca 
aggcctggcc 
ctcacagtcc 
gggtaacagt 
ttggtgacaa 
gcaaaaaaaa 
tccactctca 
tgttcccagg 
gctcggtgta 
tgaatctcaa 
agttccatgt 
aaaatttacc 
gatgagggtg 
atacccctaa 
cctgcaaggc 
atgtagggat 
gaagctcatg 
tccttgtaat 
ccagaccatc 
ttgagtggaa 
ttgaccgcac 
ccaatggacc 
agcatgcaga 
gtctttgtcg 
cgagacggag 
caagctctgc 
tacaggcatg 
tcatgttggc 
gcgaagtgct 
ttgtatgaga 
caaattcagg 



ccccaacagc 
atgctggcca 
aggagtcaag 
tgactgtgtg 
atattctcta 
gagaaattcc 
ggacaccagg 
agaatccagc 
agcaggcacc 
gcctgactca 
aggccccgtc 
acctggagcg 
ttggatccca 
tggtgttcca 
gtttactttt 
tggaatgcat 
tcagtgacag 
gagtcaacag 
ggagcagaag 
agcagggaaa 
agctgccatt 
gtctctatca 
aaaacaacat 
agtttggctt 
caagagccct 
tcatctctgc 
gctgctctca 
cagaatccca 
gccagagtgg 
gcccagcctc 
agatgcggaa 
gacatctttg 
acataactca 
acRtctctca 
tgctgtggaa 
aggctgcctg 
tgaggatttg 
agatatttag 
gaaagagaaa 
agagactcat 
gcggcccggt 
accacgcaac 
ccccggaagt 
cagaggctgg 
tcatttcctg 
atagctctat 
ctcagagact 
tgacaaaagt 
agcaatgacg 
ttgttactac 
actctaacaa 
aaggagctta 
cagtcaaaga 
ctctctgttt 
aagagaaaaa 
agagggcgat 
ttcggatggc 
tttcgctctt 
ctcccggatt 
tatcaccatg 
catgctggtc 
gggattacag 
agaagggtac 
tgactgtttg 
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73441 
73501 
73561 
73621 
73681 
73741 
73801 
73861 
73921 
73981 
74041 
74101 
74161 
74221 
74281 
74341 
74401 
74461 
74521 
74581 
74641 
74701 
74761 
74821 
74881 
74941 
75001 
75061 
75121 
75181 
75241 
75301 
75361 
75421 
75481 
75541 
75601 
75661 
75721 
75781 
75841 
75901 
75961 
76021 
76081 
76141 
76201 
76261 
76321 
76381 
76441 
76501 
76561 
76621 
76681 
76741 
76801 
76861 
76921 
76981 
77041 
77101 
77161 
77221 



gccagaggct 
gcctgactga 
gaaactttaa 
agccctgaaa 
aaattctgta 
aatcttctat 
tttaaRcaat 
caactctaag 
cagaaacatt 
tgaacaattg 
gctctgttga 
ctgatggaga 
taagtaggcc 
cattctgtat 
ctatgcatgg 
agtggttaca 
agactcctca 
catcagYctt 
tcacttccca 
ctcaggccac 
aataggaagc 
caacttggcc 
accattttgg 
gtttgcagac 
catggatttt 
tcaatcctgg 
tctacaagtt 
gcagtctgga 
agaatcatga 
caaaggctca 
gggccaggtg 
ctgaggctca 
caacattcct 
cgtttccaat 
gccccaagca 
cacctttacc 
gagccatgag 
aggacttaca 
taaactgctt 
ttacttggtt 
tcagctgggc 
gcacccttaa 
aagccactgg 
acagggYgag 
ggcgggtctt 
ttctccattt 
ctctcttcct 
cactctccag 
cgcacttgcc 
ctaacactga 
ttttgtcccg 
gggtatcggg 
ggccggtttg 
caggggagac 
gtttttatta 
acaaagatca 
gatcacagga 
tgtcattgat 
ttattaggcg 
tatttcatcc 
acccaccctc 
agcgatattt 
cactgttggt 
ttcttttttc 



tcccattagg 
ttggttcaag 
tcagagacac 
gggcaaacca 
gattttgttc 
tttgtttaag 
actagcagta 
tgtcagggca 
caaagcttaa 
acagacaaat 
cttcttcagg 
gttacaggga 
ctgggtgcac 
ctcttgcccc 
aaatacagat 
ggcctccctc 
ggaggtctcc 
cYRtccttct 
aacaaatcct 
aRcagcctac 
cattgaagac 
caaKgtggag 
aagcttttgc 
tggatgaggg 
cYYtcaacct 
gccctggctg 
ttccctgagt 
gagtaagcag 
ggaagagaag 
gtcatcagtc 
gatgtgagat 
gagagatggg 
tgacttgttc 
atagcatttg 
gggccgaccc 
actgttagga 
accaaggaat 
cacagggcac 
ttcagttaat 
ggttctcaga 
accatggaaa 
ataacctaaa 
ctgactcaga 
agaccatcac 
tagtggagcc 
tcagctcttt 
cccccaatat 
tcccctctcc 
atcttgagta 
gttccataaa 
gcctgtgtct 
gaacctgccc 
agaaataaag 
atcacacatc 
gggactttca 
cctacttcac 
acacaggacc 
aacatcttat 
ggaatttcct 
ctacagtctc 
aggggtgcat 
ctcccatttg 
cagagtttaa 
aaggtgccca 



gtttgtgtct 
aacagtcact 
taggaatcaa 
tatattcctg 
ttccttccat 
acagatagWt 
cagtggaatt 
aatgctggct 
gtgaaatgta 
gaaacacaat 
aagaaataga 
ctgaggcaag 
tttacgagtg 
tgctctgggc 
aattaatgtc 
tctcctccct 
agtcgcccac 
cYgtgtcact 
ctgcctggga 
attagacagc 
atggaagccK 
gaWgagctgg 
aggagtctgg 
tatttgctgt 
acatacaggc 
ctagatcagg 
gcttcagggg 
aaggggccat 
tcaggcaccc 
agtgttgctt 
atcatcttgt 
gtctcttgac 
ctggcatctg 
gagtttcatt 
catgtcaagt 
caaaaacttg 
gacttaaaca 
tcagcaagac 
tttctggctc 
tactctccag 
tggctcatta 
ggggacctgt 
tacaccccYg 
cagggaagga 
gctgtttctt 
cttttcctgg 
actctttagt 
tcccttattc 
acccctcccc 
ctttacctcg 
ggctcacctt 
cgatggtcac 
gcacagagta 
ggtaggttcc 
aaaggggagg 
aaggtaatag 
ggggagaaat 
caggagacag 
cttcctaata 
gaccatagaa 
tctctttctc 
cttttgaaag 
ggttatctct 
gatttcatat 



ctgttactca 
gacccaagct 
agtgtgtgga 
atgttggtaa 
cctctgacct 
tctgtttctg 
gcatcaaatg 
ggatcactga 
tgaaaagaat 
gttctggcgt 
tgaatcaact 
tgttctgcag 
acttaggaga 
tccagctgag 
cactggggca 
gtagaggaac 
atgggtgacc 
cctMccagta 
gccttgccta 
atgaaggctc 
gKgtttgatg 
cctagcggca 
gcgaagtgag 
ctggctgtct 
ccttctgatc 
accacgcctg 
ccactgggca 
ggaggcctag 
agagcctagg 
gaacacaagt 
ccagcccccc 
tgcaggctgg 
gttttgcttt 
gaaatgtcat 
cctcttgctg 
agacaggaac 
agtccagctt 
agctcgagag 
tttgtctact 
gatgtttgga 
cccggccttc 
cacactacaa 
ggaggacaag 
ttccattctt 
ctcctgtatc 
tcttctcatt 
ctagagtaaa 
caggccccag 
tgctgtgctc 
ccctctttat 
ccctggtggc 
gtaggttctt 
caaaagagag 
gtgatgcctc 
gagtgtatga 
aatatcacaa 
taaaattgct 
ggttttgaga 
agcctgggag 
gatggccaca 
agggatgttc 
aagagaaata 
cttattccct 
tgttcaaaca 



tcgctgttcc 
atcctcYcat 
gctaagccag 
tggtctgctc 
gctctagttt 
ttgcttgcaa 
caggtagcct 
agaagtaaat 
gggaattatc 
taggcaagtc 
gaaggtcagg 
gatgctacat 
ggttctgggg 
gcgcccagca 
aggccatggg 
attctgaaga 
ctatcaagaa 
ccccagtcct 
gggctctccc 
tgggcttttg 
aaatctgtgt 
agggctgtRt 
aaggctgact 
Stctctgtcc 
ttagcctcat 
tactagagcc 
ctgggttcgc 
ctccctgaga 
ggagcccata 
ctctagaaca 
tttcccagct 
ttcatctcct 
tatttttctt 
cggtccctca 
tctgctgaca 
ttgccctcaa 
ggcgagtagc 
atccRgcctc 
atatatgagc 
ttctcagaga 
agggttcaag 
ccaccaccac 
ggagagtgga 
ggaaggacat 
caacagttct 
gcttgttcct 
ctgcttcttt 
catttctgcc 
agctacagat 
tcccctaaaa 
tgcactgctc 
ttctattttc 
aaattttaaa 
acaagccgca 
atagggtgtg 
ggcaagtgga 
aatgaggttt 
gcaaccggtc 
cgctatggga 
cccaaggagg 
cttgctgaga 
tgactctgtt 
gaacaat'tgc 
cacatgctct 



tgataccata 
ttgagaatgt 
gtgcaggcag 
ctgtccttcc 
tcatccaatg 
ttgaaaaatt 
gatggcattt 
tactcagtgc 
caggaattga 
ggtggccgtt 
gaagccttgg 
aaaaaaggac 
tgattttgtg 
gttctatatt 
ggacaggagc 
cgctccttga 
tttctcccgg 
gctcctagaa 
ttcaggggaa 
tgctaagagt 
ttttatgggt 
atcagcgaag 
ggatgagggt 
ctctatgtct 
gtgacccatg 
aagccagaag 
agagaaatga 
gcctggaaga 
tatcaRcaca 
caagagaaga 
gatgaggaaa 
caatgcacca 
cacacatcaa 
gtgtacgctg 
tcaccacaat 
agcgggaaag 
taagtttatt 
ccccaatgtc 
aacgagactt 
cacctggtcc 
caggggacat 
catcatcaag 
tgctggtaag 
ccaaccgggg 
aactgtggct 
acgtgcctcc 
ccattcccca 
ttctccttgg 
gccaaagttc 
aatgtctgcg 
tggggtcatt 
cctaagtgtt 
gctaggtgtc 
aaaccagcaa 
ggtgtgggtc 
ggcagggcga 
cgggcaccat 
tgaccaaaat 
gactggggtt 
ccatttatag 
aaaagaattc 
ccaccaggct 
tgttatcctg 
acaatttgtg 
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77281 
77341 
77401 
77461 
77521 
77581 
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77701 
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77821 
77881 
77941 
78001 
78061 
78121 
78181 
78241 
78301 
78361 
78421 
78481 
78541 
78601 
78661 
78721 
78781 
78841 
78901 
78961 
79021 
79081 
79141 
79201 
79261 
79321 
79381 
79441 
79501 
79561 
79621 
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79741 
79801 
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80521 
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80821 
80881 
80941 
81001 
81061 



cagttaacgc 
taggagatta 
cagtgtccat 
aattataaaa 
ttcttctgcc 
tgggcctagc 
agctcctgag 
tccagtgagt 
ttgcccaggt 
gtctcaacaa 
ttagatctcc 
agtgcaaaRg 
tggtcagtgc 
ttctccacga 
gctgcagtaa 
gacaggatgc 
tcaatctcat 
tgaggaaacN 
ctcaatagcc 
cctgtaccaa 
caataagatc 
ttctaaaaga 
gcactttggg 
aacacggtga 
tgcctgtagt 
ggagcttgca 
tccatctcaa 
catagatatg 
tgctctttag 
atcactacca 
agagcaaaaa 
gagagcatat 
tgcctggtag 
gatgggtgat 
actaaagggg 
tgtgatccag 
gccaaacttc 
catgcccatc 
tcaatttaga 
cttcacccat 
ccagtttagc 
ccttaccctg 
agtctccctc 
tcaYaggaat 
gttccacatt 
ccctctcaga 
ccacaatcgt 
acctgagggg 
actcgaacgt 
tagaaggtgt 
ctgactaaag 
ctattagtcc 
tccaaacttc 
cagcttcatg 
atgaggactt 
aagatggtag 
ttcaaattac 
tgcctctact 
cccaggcatg 
tcaaatcagg 
ccatgttaca 
ctccagcctt 
ctctgcccct 
cctttgacct 



aattatcaca 
aagtaaagac 
gaaatcttca 
gtattaattt 
atggcttcag 
ctgcttttag 
cccactgcct 
cttgcacaag 
tccgggctac 
catttatagg 
tgagtgtatg 
aaagcacatt 
tggttgcagt 
tgctggaggc 
tgctcaggct 
cagtggagcc 
ttgcaatgac 
ttcaaaaacc 
acatatggtg 
taaaggacag 
tattctgcat 
taaataattc 
aggccgagat 
aaccccgtct 
cccagctact 
gtgagccgag 
aaaaaaataa 
ccacagtact 
tggctatgtg 
gggtgtgtgg 
gaccttcatt 
ccctttgtcc 
gaacgtcttt 
gggcgtttag 
ttagaccttg 
ccccaggaag 
agttgggcgg 
ccgtgcttgc 
gaggatcccc 
tcatttgata 
aggaaccccc 
gtgttggctg 
ccctgtaaca 
ttttccttaa 
gacacaggag 
ctctgccctg 
gcaataacag 
ggcttaagaa 
tgcagttcac 
caatttgcct 
gccactgggt 
tggctgcaca 
agggtctcac 
tgtatacagg 
tgaaaatcat 
agaaagtgtt 
accagttgaa 
ctatgtagaa 
gtgtcgggga 
gagggaagaa 
tttcatgctc 
cctgctgctc 
ggcaatgtct 
ggagttcatt 



tggtcctgag 
aggcataaga 
caatttgtgt 
ggggaactaa 
ccagtccctc 
actctgatgg 
cagcgactca 
gttgactaca 
ttttctcact 
tacatatcgc 
gacatcattc 
gggtgtgatg 
cagcctgctg 
gatcccagcc 
cagccctgct 
agacaccaca 
acgaagcctt 
actccctaaa 
taagattctt 
atgattaaga 
aaatcacaga 
ttttcaaggc 
gggtggatca 
ctactaaaaa 
tgggaggctg 
atcgcgccac 
taataataat 
tattcagtat 
tttgataaag 
agaggggaaa 
cagaggatca 
tcggcccctg 
gtttccctgg 
ggctattagg 
gggaagggct 
agctctgcat 
ttctgagcct 
tgggcctgca 
ccagcttgct 
agcgactgtc 
cactctggtg 
tgactccccc 
acaatcctga 
ctgggtgagc 
aacacagcct 
tatgtctctt 
ttatcagtga 
agcccctcaa 
ctaaatctga 
ggcatgttct 
ccagccactg 
caaactctca 
tctcacatgt 
tcaaggccct 
ctctataaat 
tttctctttg 
gccactcgtc 
cagaagaagg 
aacagtcacc 
atatccctct 
tcagtgtctc 
ctccaatcta 
ctctgggtct 
ctaaggcctt 



gtgacataca 
aatcacaagg 
ttagagattg 
taaatgtcca 
cgtttggggt 
atatgggtga 
gcagactcgg 
tccagcacaa 
atccgggtgg 
caagcaatca 
gcaatcattt 
tcaYgcagaa 
gctgtgagtt 
gtggcagatg 
gtaaatggtg 
ttggcgatga 
tctatggact 
ctgctgttct 
cagagcttca 
aaggcagctt 
gctgtttcta 
cgggYgcggt 
caaggtcagg 
tacaaaaaat 
aggcaggaga 
tgcactccag 
tctaYtcaag 
gggaaaaaat 
atggctcctc 
ctttgtagaa 
tagaacagtg 
gaaagggatc 
tggtttggct 
tgtctcatct 
ggagactcca 
tgtcatgagt 
cttctcctct 
ccgccccagc 
gtctgagcac 
tgacctgcct 
tctcctgtca 
tctgacttta 
ataaaggctt 
ttccctggtt 
cctgaggaca 
ccttaatctg 
gttacgtgag 
gtttgcaact 
gtccttaaga 
ctcttccaat 
ccacctatga 
agccacctta 
caccagccaa 
gtacaatttc 
cactaatgct 
gttggggact 
cccactctgg 
tttcaaatgg 
catggcccac 
cctactcacg 
ccttctctca 
ggtccacccc 
ttgggcggca 
caccacactc 



tcctcctcag 
gtattgattg 
cggtaaagac 
tgaaatcttc 
ccctgacttc 
gctcattgag 
attttgcccc 
caaggacacc 
gggccccacg 
aagggcgtcc 
ttgtggcttc 
tgtcccttaa 
ctgctgacct 
ctattcccag 
ccaacaYaac 
cgcagcctct 
cctgaatctt 
ttttgctgca 
tagagagcat 
acttacctgt 
taagtcaaat 
ggctcatgcY 
agatcgagac 
tagccaggYg 
atggcatgaa 
cctgggcgac 
gttaaatatg 
aagttaaagt 
atgcaaggga 
aaacaaagag 
atgaattaac 
tctggggccc 
acaggccagt 
gccttccaga 
tgtctgccat 
taagaaagac 
agacattgac 
tcagcaagaa 
acttgctctc 
ttagggagca 
gtcctgttcc 
ctgtattggg 
ccttcctgat 
ggcaacaccc 
atgaagcttc 
tgttctttcc 
tccttgtagt 
ggtgtcagaa 
attcaaacac 
gcctcatgcc 
ctgaaaagtg 
ctgcctcctg 
actaatgcct 
atggaagctc 
gaaaagacct 
tgtgagtctt 
aagttggcct 
cagcccagaa 
tatgcccatc 
ggtcttctgc 
ccctcctccc 
tgccccatcc 
tcatgctatt 
ccatctgatt 



ctgacaggat 
gggaagtgat 
aggcatagga 
acaatccacg 
ccgcaacaat 
attctcctcc 
cttgtgcaag 
tgaagtggcc 
tgttctcagt 
aacagtggct 
gtcaaaatca 
tgcctccaat 
tgtgtatgtg 
ccctacccca 
gccaatgaca 
gtggaccttt 
ccatctgatt 
tgtctttgtc 
ctgcYtcttc 
aaaatgagct 
ggaagtaaag 
tgtaatccca 
catcctagct 
tggtggtgag 
tccaggaggt 
agagcgagac 
ttttgtacaa 
catcttccga 
ggtcaatgcc 
aagtgggttc 
tttattagaa 
tggaatgtcc 
atagcaatgt 
ggaactgagg 
gacggcagct 
ttactccttg 
cttggccttc 
tcccccaaag 
tgatcacggt 
tcctgttagg 
atctgccgcc 
agctgggctc 
ttaatgtgat 
catgtgtttt 
acaactgaaa 
ctgtgataaa 
gaattatcca 
gtaagggtgc 
aacctcagac 
ctttgacgtt 
gaagacctgg 
acctggcacc 
tagcaggtga 
ttgaggactc 
gcatttggac 
ccgagaaccc 
gaagaaacat 
ctccccattt 
agtgatggga 
aagtccatga 
agccttgatg 
tgacccctgg 
tggttcccca 
gaattactaa 
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81121 
81181 
81241 
81301 
81361 
81421 
81481 
81541 
81601 
81661 
81721 
81781 
81841 
81901 
81961 
82021 
82081 
82141 
82201 
82261 
82321 
82381 
82441 
82501 
82561 
82621 
82681 
82741 
82801 
82861 
82921 
82981 
83041 
83101 
83161 
83221 
83281 
83341 
83401 
83461 
83521 
83581 
83641 
83701 
83761 
83821 
83881 
83941 
84001 
84061 
84121 
84181 
84241 
84301 
84361 
84421 
84481 
84541 
84601 
84661 
84721 
84781 
84841 
84901 



tagcattgga 
tgcagggaca 
tcctgggtca 
ggccccagac 
ccaaagaaag 
gccagcaagt 
ctaaatactt 
ggaaagcaga 
gaccctgccc 
acttggagga 
ggctgcctgg 
caccggggtg 
acRaatctct 
ttcttctgga 
aaaggataag 
cggtgttaat 
tatagactga 
tatttggaga 
cctactccta 
tctctctctc 
acacaccact 
gcagagccct 
gaactgtgag 
atcctgagaa 
tttcagctga 
aatattccag 
tcttcaaaat 
gcagcatttt 
tgagtattct 
atctagatgt 
tggtttatca 
atgtttcaac 
acaaagcatc 
gtttgcaggg 
tatgtatgca 
tcattctcat 
cagcaagcag 
gactccatat 
gggttttaaa 
Ygggcagtgg 
gtggcccctc 
ctggcacctc 
gagagcttct 
WgctctcRgc 
cagtttgctg 
aaggggagga 
gctgggcgag 
ctcagggttg 
atggtgaata 
aaaaccaacc 
gaactggggt 
agtcatgagc 
aaagaacaag 
cagctgtagg 
aatctggcta 
gactcaatca 
acaaaactgc 
cttactctgc 
ccgactggct 
aataaaagct 
ctggacaagg 
tcaacatgac 
ctaatcaaat 
agtttttcat 



cagacggcag 
gagccaggat 
gcagaaggat 
agcatYtccR 
tcctgctggg 
tttgtaaaag 
tacatgcatt 
aagtagtcaa 
ttgtgggctt 
aatattcttc 
aggaggYgtg 
ccccaaggag 
tccaggcttc 
agtattcaat 
attggaagaa 
cctcctgaac 
attgtgtgcc 
ttgagccttt 
taggactggt 
tctctctctc 
gtgaaggcca 
ccccagacat 
aaaatacgtc 
gagtgattca 
gaatcaccag 
tgatcgcttc 
ctgaattttg 
aaaaagaaaa 
tttgtaagat 
gtttaggatg 
ctgaccactc 
caggaaacca 
ttagtacaca 
atttgggaat 
ctgggacatg 
ttcacagaag 
gaagctcagg 
atttaaaaaa 
cattttattt 
agggggttct 
cgtgttgcct 
tgaacccagc 
tccttggccS 
ttctaatctt 
tccagggcgg 
agccctcctg 
ggattgtctt 
gatggaggga 
ttaaagcaaa 
agaccttcag 
catatcagat 
agccagctga 
aactccaggc 
gacaggggga 
agaccagtaa 
taccctcctt 
gtggcccccc 
cccttaactt 
cttctgagtc 
aatttgcttt 
ctaaggtccg 
caggaatttt 
aatgtcctaa 
cataacacaa 



gtcccagtct 
ccaccccagc 
tctgctgtgt 
ggaacacaca 
cttccatctc 
gtaacgctaa 
atcttatttt 
gtagcttgca 
cttctcccct 
ctgcactgct 
cctgccagag 
gtaaccccat 
atcgctagtc 
gacttcttca 
agtttgctac 
cagctgtcac 
cYccaaattc 
aaggaagtca 
gtccaattaa 
tctctcacac 
tgtgaggaca 
caaccctgct 
agttgttgaa 
ccagggaagt 
aaagtgagct 
ctcctgccat 
acccatcagt 
gagcattgaa 
acttgaaata 
gcagcatttt 
ccagcaggga 
atgaatattc 
gcaggattac 
tatggaactc 
atgtataatt 
gagaaattga 
atttgactca 
tgcctgtggt 
gcacagagga 
ttaggtgggc 
ctgtcccctg 
tggccccgtt 
cccagcccag 
ccctgccctc 
cYcctgagca 
aggaaatggg 
gttttttctt 
acctccagca 
attaaaataa 
aacagctgta 
gggcgcttct 
cagagggcca 
agcaggttca 
ggggatgggc 
gatctaactc 
gtcccacacc 
agttccaaga 
agtaaaccca 
cgcctgcact 
ggctttgcct 
gtctcacctg 
ccaacaaggt 
ttgcacttcc 
aacctcctcc 



ccaagacata 
tctgtccaac 
ttgtgcagtt 
ttgatcaggt 
taagaaaaag 
ctgagcacct 
tctcacaata 
aagctcccat 
cagcctgagg 
gagagggaca 
aaaactagcc 
ggaggttacc 
agcaggattt 
gtgaagcgtt 
cacataaatg 
atggggtatt 
aaatactgag 
ttaatttaaa 
aagaaaaaaa 
acacacacac 
cagccctgag 
ggcaccttca 
gccacccagc 
gccacgatgc 
ttccaccatg 
gtgcctattg 
gggttatgac 
aatatcagag 
taYctataga 
caatcttcta 
ctgggcccca 
accttaagtg 
tgcccagatg 
cagagaaggg 
gtttcctgct 
gacccagaca 
aggccagcgt 
ttctgcattc 
ttcctcctgc 
aggggagtgg 
caggctggtc 
agtaaaccac 
cccttcctaa 
ccctcatctt 
gagctgactt 
gcttagggaa 
tttccttctt 
ctaaatttgt 
aaaccagacc 
accccccatg 
gtcagacaca 
tgctgagatg 
cataatggac 
ccagaggacc 
agcactgggt 
cacctgtgcc 
aatccccgcc 
caaagtcaga 
cccacccctg 
caaattcttt 
cattcagaga 
aaagcaaaac 
atattcaccc 
agtttggttt 



aaccaacagc 
tccattccag 
cctcagaagt 
caagattttt 
acttttggYc 
actgtgtgtc 
aagctcaatg 
caccagtaga 
tcactcactg 
tcctcaagcc 
cggggagatc 
tgggcaattc 
tcagatgcac 
tcttttctag 
gcattgagta 
tttgatggag 
gctctaaacc 
gggagtcata 
agagacagat 
acacacacac 
gcagcctctg 
ccttggactt 
ttgtggcatt 
tttgtggagg 
ttttctcccc 
tcaaacactt 
aatagttcat 
tatattgaac 
cccttatgta 
gaagggacag 
ccttgtaatc 
gcatacataa 
agttgcccca 
gctgggaacc 
gtgctcatac 
gcgagagaca 
tcccttgcca 
tcccctgtag 
tggtcatggg 
gaccctcagc 
tcagggtaca 
agaYcatctt 
ggattctggg 
ggacaggact 
ggttgccatg 
atgaacgcag 
caccaaaagg 
gttttattga 
tgaaacattc 
aaactgcaag 
aacacagctt 
tggatactgg 
acagaaggag 
ctgacaaaca 
ctgacccacc 
acagaagttc 
tcttcccata 
aaccccaaac 
agcctgcgct 
ctctcttcgt 
cctccctaaa 
aggcagtttt 
gataaacact 
tctctattta 



cccagagctg 
cctctctgtg 
tcaggaaaag 
ctgacttttc 
ctggtctctt 
agtcattatg 
gatgaagaaa 
tggcagaact 
ccagcgaagc 
cagcagaggg 
tgggtggcat 
agccacacgc 
tgggctaact 
ttggaaacaa 
taaggtggtt 
gcaccagtgc 
ccagtgaccg 
agagtaggga 
gatctctctc 
acacacacac 
taagccagga 
cttgcctcta 
ttactaaggc 
aaccaatcta 
atgtacggga 
tacaccagtg 
tgtgtcaaga 
atagcaagag 
aattcctggg 
agggtgcata 
aaaggtacaa 
tattatcact 
aaaatgtgtg 
agtttgtatt 
gtgcgaggcc 
ccacccaacc 
ttactttgaa 
tatacctgtg 
ctctctcaga 
actgatcctt 
cggctcccca 
gtttcagagg 
attggctgag 
cagcacagcc 
gcaactctgt 
tgctgacttg 
aggaaggaaa 
tggggtgggg 
ccgggcaaac 
ctatgaatct 
aacctcggcc 
gaggtcacag 
ctgctgtaac 
ggaagtggcc 
tttcacccca 
catgcttgca 
aatccttatg 
cccactgggc 
tctgctttgc 
tgtcagaatc 
cccaccggca 
atcactggaa 
tgtcttccac 
tgaactcctt 
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84961 
85021 
85081 
85141 
85201 
85261 
85321 
85381 
85441 
85501 
85561 
85621 
85681 
85741 
85801 
85861 
85921 
85981 
86041 
86101 
86161 
86221 
86281 
86341 
86401 
86461 
86521 
86581 
86641 
86701 
86761 
86821 
86881 
86941 
87001 
87061 
87121 
87181 
87241 
87301 
87361 
87421 
87481 
87541 
87601 
87661 
87721 
87781 
87841 
87901 
87961 
88021 
88081 
88141 
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88261 
88321 
88381 
88441 
88501 
88561 
88621 
88681 
88741 



cttattcaga 
attggactca 
cacagctcgg 
ggaaccaccc 
gtaatgtttg 
taaggagctc 
tgggaagctg 
agagctggag 
gagggaggtg 
ggccaaccct 
taagtcattt 
aggagaggtc 
ctcttggggc 
cagccttgaa 
ggcgagccta 
ccagcctgga 
tgagccacct 
tggcagcaaa 
atctgctatt 
ggaaaagtcc 
tctgatcatt 
taacaatgac 
ctgtgatgtg 
ggagtctcgc 
ccgcctccca 
tgcccgtcgc 
ttagccagga 
gctgggatta 
tgtctcattc 
agaggaatcc 
ctgcttcaac 
tcatgtatac 
gctcaacttt 
gcttctctaa 
ctgctgtggc 
tgatggacct 
ttcacgtaaa 
actaggtcag 
aattgtacca 
ctggctctca 
aactaaatat 
tgagatgtga 
gaagttaaga 
agctacaaag 
gagRagagat 
aataacaata 
cttttctcac 
tgtttgcaga 
ctccagctct 
ctgcctttgt 
agggtaaaag 
ctgggcgctt 
tcctgctttg 
aKKagaacNg 
ggactcaccc 
ggcctggctc 
tagcctcaac 
ggaagtgatt 
caattgtgca 
ccagatgcag 
cctccttggg 
tcatttcatt 
tcctagctca 
gagtagctgg 



taaatgttgt 
gcccagataa 
gaagggggtg 
cttagggagg 
atcctcatga 
caggtcacac 
ggtcactcta 
ggaggggacc 
agaggtcagg 
caaaagccaa 
gctaggtgtc 
aggatgcaga 
ttggggcagc 
gacaccaccc 
ccagcagcca 
tggttttgct 
atgggcaaag 
tgccaagacc 
tacagacact 
gcttgtcctg 
acagaaacta 
cggaaacatg 
cagggcttct 
tctgtcgtcc 
ggttcacgcc 
cacattcggc 
tggtctcgat 
caggcgtgag 
aatgatcaca 
acaaaagtta 
ctcatgtctt 
acRctcatgg 
tcgtcttggt 
gatcctttgt 
tgccgagccc 
tagggtgttt 
actgcccaca 
agggtatcaa 
attcacactc 
gtagtctctt 
cttgttatca 
agcactgaac 
aatactgggt 
ttgctaatgg 
agagggaggg 
ccaYttcagg 
cgtgtgtctc 
ctttatttta 
catactaaag 
ctcagggttc 
ggggacccag 
cttgctgtaa 
ttcactgtgt 
ggaggaccag 
gacgcttgtg 
agacgctgat 
taggacacaR 
ttggttccta 
gcccagacag 
aSgacaaaga 
caggaaaaga 
tttaataaac 
ttgctgccta 
gactacaggt 



tgtgcctcag 
agaggtgctt 
gcttccactt 
ccgtactgtg 
caacatgggc 
agaggggaca 
cccacagggc 
ggcctctgct 
gctgctggtc 
ccctgctgtt 
agggtaggga 
tgccccccag 
ctcatcRgcc 
tccattctag 
ggtctctgaa 
gcacccttga 
ggaggcttga 
aacctaaccc 
caactcattc 
ttggagaatg 
cacagtctat 
tgtgacaact 
ttttttattt 
aggtggagtg 
attctcttgc 
taattYtttg 
ctcctgacct 
cggtcgcgcc 
cacagcccgg 
gggaacttgc 
caatgactgt 
taaatgactc 
gaattcactg 
cccacaaaga 
tcctggcgct 
ctaattcttc 
tttgatttta 
catctcaaaa 
aaactagcag 
tatcaggaca 
ataccaatta 
aaaggagaag 
acaactgagg 
Ragacataaa 
tacaaataag 
cccatcccag 
tgcgcactcc 
acagctcagt 
ccccctcctc 
agacacacag 
tcctgggcag 
gggacagggg 
gaccccctag 
gagagagcag 
atgagctgca 
ctggggcctc 
tgcKtcccct 
gaaaaaccca 
ggagccctcc 
Ytttcagcaa 
ggggttgaga 
cgttttgctS 
gatctgctgg 
gcacgctacc 



atgtttcttt 
gtgacaggca 
acaatcaaca 
tggcaggcac 
aaatatccta 
tggccaagac 
cactcagagc 
ggaaacaccc 
aacctcctct 
tccatttccc 
ggaagtgggc 
aaccctgggt 
acctccRtct 
ctgtgggtag 
aggcttacct 
ggaagagaaa 
acagctgggc 
agattctttc 
cagctccctc 
aggagaagat 
acgcagaaat 
aattgatgat 
ttatttttat 
cagtggcgcg 
ctcagcctct 
tattttcagt 
catgatcagc 
gggcctgtga 
gagatgggta 
cagtgttttc 
gctctactcc 
ctcattgcaa 
gtgctgcagt 
cctggatcct 
gccctgttcc 
atgtaaatta 
tttccttaag 
tgttttatac 
tatatgaaga 
aaaaatatat 
gtacaaactt 
gaggattaat 
gtataagaaa 
actggtgata 
atgaaaatga 
ttaggggcgg 
ctggcacacc 
ggttcccggg 
aaaagccact 
atgtctgtct 
cNttcaccag 
tgacaggaga 
gccaRggSaR 
aggagcagga 
cccaggatcc 
tgctgaatgt 
ctaggcgagt 
catggctcct 
ctcccacctc 
agcagctccc 
cagtgtgctc 
tgtcacccag 
gctcaagtga 
atgctgggct 



gacaaggggt 
gtgcagggtg 
ggggtcaggt 
tggctggccc 
gtggtgcagg 
cagaatgggg 
agaggggctg 
cagggtcagc 
cccacccctc 
caaggctgtc 
acctgggcca 
caggggactc 
gggttccgtt 
gaaccagcca 
tggaactgtc 
acaaattgtg 
ctctgcagat 
tctatcacca 
ttccctcact 
aatcagagga 
agagatggga 
agccagtctg 
ttatttatat 
atctcggctc 
cgagtagctg 
agagaagggg 
ccgtctcggc 
tgtgcagggc 
ccactttccc 
tattgctaga 
cttaaatata 
atccatgtct 
ggacagcctg 
gccagatgcc 
ctctgctgtc 
tgctacaaag 
atatttgtag 
ctagagccac 
tgcccattct 
acacaacatg 
taaggcacga 
attgtcattt 
gttgtaagta 
gaatgttagg 
ctaataatga 
ctccaagcat 
cacttccagc 
ccccctgatg 
cgcctccctc 
tctggggccc 
gggagtatgc 
ggggcatccc 
ccatctcaac 
gggcagagag 
catcctcctt 
tgaggctgga 
ctaccagttg 
ggcccagccc 
agggctctga 
tccatgtcgc 
ctcctgagaa 
gctgcagggc 
tcttcctcct 
aatttgttaa 



actttcgatg 
ggtgccctaa 
agaggggatg 
aggtacccat 
tgaggaggag 
tccagttcag 
gtgcagggca 
caggagcaga 
cgctactcct 
ctgccccact 
tgacccgctg 
cacgtggcct 
ggggctcact 
gggaagagga 
cagccactgt 
ggattgaatg 
ccctctgagg 
agtccccatc 
ctcacaccaa 
ggggcagcca 
aggaacgttc 
tattgagtgt 
tttttgagac 
actgcaagcg 
ggactacagg 
tttcaccgtg 
ctcccaaagt 
ttctgacatg 
atttgaacac 
gccaggactt 
caatattgag 
cacactccgt 
tgtccccctt 
ggcctagccc 
ctcccctagc 
aacaYcctcc 
gagtagaatg 
actRcatgca 
ttcagtctta 
taaatttgag 
aggtagcagt 
tttaaagaKg 
aattgtttac 
aaaaggtgga 
tactataaat 
taccattgtt 
agctggtctt 
gggttccatc 
tttccctttc 
ccatgcaRca 
agaggggcRt 
ttcSccttSt 
accaNgagcc 
ctggggcctc 
ggtcattgtt 
tactgactgt 
gtcaattccg 
aagccctgcc 
gccaagctca 
tgcggggcct 
atcaccattt 
tttgacgcca 
tcaaccttct 
tttttgattt 
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88801 
88861 
88921 
88981 
89041 
89101 
89161 
89221 
89281 
89341 
89401 
89461 
89521 
89581 
89641 
89701 
89761 
89821 
89881 
89941 
90001 
90061 
90121 
90181 
90241 
90301 
90361 
90421 
90481 
90541 
90601 
90661 
90721 
90781 
90841 
90901 
90961 
91021 
91081 
91141 
91201 
91261 
91321 
91381 
91441 
91501 
91561 
91621 
91681 
91741 
91801 
91861 
91921 
91981 
92041 
92101 
92161 
92221 
92281 
92341 
92401 
92461 
92521 
92581 



catagagaca 
ttcccgcctc 
ttgatgggca 
aatcccagca 
cctggctaac 
ggcgggcgcc 
ggaggcggag 
cgagactccR 
tcagatttca 
ctgagagggg 
tggagagcag 
tgattttagt 
agctctcact 
caaagctggc 
cagccatgtt 
ttgctcctag 
gggtccaggg 
ttcagtttag 
tgggatggtc 
gtgcatttgt 
tcagtaattt 
caagatatta 
caggacgtga 
ccctaaagat 
atatggacag 
gtgctgttgg 
attttgtatt 
gggtctccct 
agaggtctga 
gctcagcctc 
ccacagggcc 
ccttgtatct 
agcctggttg 
ccttctttga 
ctggggtcac 
gaacccatct 
ggagcaatca 
tctgaacgag 
atagagctat 
ggatcttcct 
ctccagctgt 
gaattcgaaa 
acacgctgct 
aggagcagcc 
tcggggttgg 
gacccttcag 
tcctgcccat 
tatatttatt 
ctgtagccaa 
ggctcaagcc 
ccacccagct 
ctactcttga 
gcctgctctt 
ctgccgggat 
tcaggcattg 
gtgttccact 
ctatggctaa 
gaagggcctc 
ttcatcctga 
cacaccagct 
tgcaatacat 
aatgggccag 
gttgctttca 
acacgtctca 



gcatctcctg 
tgcctcccac 
tctctttttg 
ctttgggagg 
atggtgaaac 
tgtagtccca 
cttgcagtga 
tctcaRaaaa 
cagaaattac 
aaaatcttag 
acgggaagcc 
aatccaacag 
cctgtagtta 
gtcatgccac 
tcctattgcg 
gcggatcggc 
gggtcaccgc 
tgagggcggg 
acatggggat 
atgtagaagt 
ctaacagagc 
atcagcagta 
agctagataa 
gcctagaaga 
gcgcccccct 
ttcgttctgg 
tacaataatc 
acagtttccc 
acagagggac 
ctgacactgg 
tggcacaagt 
caaaagaaat 
tggccagacc 
ttccttcaaa 
ctggccatgt 
gagctatttc 
gactcaagcc 
ttacctaacc 
tgcgagaata 
ctgacagaga 
gcacctatat 
ggtaaaatga 
ccccagcacc 
tgagaccaga 
gggtctgcta 
attactcccc 
accatcagca 
gtaaggccat 
ggctgaagtg 
atcctcccac 
aatttttaaa 
actccaggcc 
gttgaaggcc 
cccctgcttc 
cctcctgtca 
cccgaggcac 
gaagtcatgc 
aaagacagct 
gcaaacatcg 
tttgagacag 
tccatttgca 
gattatgatc 
tccagcaact 
gtcagtttag 



tgtttcYcag 
agcactggga 
gtgaagaaaa 
ccgaggcagg 
cccatctcta 
gctactcggg 
gcagagattg 
aagaaaaaaa 
ccccctgatt 
gctgggaggg 
aacatctgcc 
tgtccaggtt 
cactcacaat 
aggcaccatt 
ggatctggcc 
aggttgagaa 
cttctggtca 
ggtaggttag 
gaagtaattc 
acagtataca 
cttaaaacag 
acaattgcaa 
ccggttagac 
gccacggcaa 
gcatccattc 
cagtccaacc 
aggagcattt 
attagggaca 
agaggtgagg 
accctgggca 
aggtgctcag 
ctcttgcctc 
ataaggacag 
tttcccaaag 
ctcagcagtc 
cccacctctc 
tggRtgcaaa 
tctccgagct 
aggagatggR 
ctgagcaaga 
aataaccaga 
aagtcacaac 
tcctgcagtc 
cctccaggtc 
tagtcttcag 
gtcctcccgt 
tttgcacagt 
tggggttatt 
caccgccaca 
ctgagcctcc 
aaagttatat 
tcaagtgatc 
atggcttgtg 
caccacctgg 
gagatccccc 
acagggtgga 
aaccagattg 
gaacctcaga 
ctgggggcat 
gaacagaaag 
aatgcttctc 
taagttttca 
cagggaggag 
aggtttactc 



gctgtcttga 
ttacaggctt 
atcaggccgg 
cagatcacaa 
ctaaaaatac 
aggctgaggc 
cgccactgca 
aaaaaagaaa 
tcatggactt 
gaggtgactt 
cttgtgactc 
tgattctgct 
tccacaggtg 
ctgcaatgcg 
agcagccccc 
ataatagaca 
acaatgcact 
tgagggattt 
tttaacataa 
gagataagaa 
aaacacaatc 
caaaagctgg 
tagaaattct 
gatgatagca 
ataggctctc 
tggcattgtc 
catcttttat 
cagcacagcc 
gccacagaca 
agtcactccc 
actcagggtg 
atcaggcctc 
accRggctcc 
agatgggcac 
taaaacagag 
tctacggttt 
tcccggctct 
tatctacaaa 
gggaggtcac 
tccagctggt 
cacgtcctcc 
ttcccagcag 
cagccccacc 
tctttcatcc 
gaagaaagac 
gcttccacat 
ctcctacaca 
ttggttttgt 
gtcatagtgc 
tgagtagctg 
tagagatggt 
ctcccacctt 
gcttctagat 
aaaaatcatg 
agcatcctcc 
gggactagag 
ctattggact 
agccaacctg 
cgtggccacc 
gtgattacac 
catttgattg 
aatgaggaaa 
ccaagacttg 
tgccaaggtt 



actcctggcc 
cagctatagt 
gagcggtggc 
ggtcaggaga 
aaaaattagc 
aggagaatgg 
ctccagcctg 
aagaaaaata 
ttggcagctc 
gcccagagtc 
tgctattgaa 
tctacaaaca 
gcttttgaat 
accttctcac 
agtgcaacgg 
cacacaagat 
ggatatacca 
agggtcattt 
cattcgtatg 
tttacaatat 
tttccataac 
ttactgacaa 
cagaagggag 
tttatagccc 
tgcaggggga 
tttacacaat 
tccgtagcaa 
tgtttcttct 
tggagtccta 
ctcccagggc 
ggtgagatga 
agcagccgca 
aggtcacctc 
ccccaacaca 
ctgagcccag 
aagggcccag 
accactgctt 
agctgaatga 
accatcccca 
ctgagctgtg 
agcccccaag 
ctcacaatca 
ctccttgctg 
aacccacctg 
ctgccactga 
tcccctgatg 
tgccaagtgg 
tctggagaca 
actgcagttt 
tttctacagg 
gtctcactat 
ggcatgagcc 
atttgcttat 
ctcatYgttc 
caagcagagg 
accactgctc 
tttgtgcttt 
acccttccat 
tccccccacc 
agggactgca 
aattttataa 
ctgaggcaag 
tcaaccaaag 
gaggatgcac 



tcaagtgatc 
gctttctatc 
tcacgcctgt 
tggagaccat 
cgagtgtagt 
tgtgaacccg 
ggcaacagag 
agaccacgac 
tatcaggttt 
gcacagctct 
gaaaaacggg 
cagagatcta 
ctcatgatta 
tcccctgtct 
ggctctctct 
agtgaaagct 
gcatttatta 
gattacgagg 
tagaagtaca 
agtgtgtgcg 
ctatgattag 
tccatggaaa 
tatacctgaa 
tagcttgtcc 
agcacatcac 
cctgcatgca 
tagtttcagg 
ccatcacccc 
tgatctacct 
ctcagtttcc 
cctctaggtt 
ccccggggcc 
cccacctcca 
aaocttcctt 
agagctttgt 
caggagggag 
tcctgtctga 
tccttccctc 
acttaccaag 
tggatctcac 
atatacccag 
agcacagcaa 
ctgcgcttag 
cctggcatcc 
catactgtgg 
cctgccttac 
atccttattg 
gggtctcact 
caaacccctg 
catcagccaa 
attgactggg 
accgcaccag 
gtggggtctt 
agtgacaaaa 
ccactgcccc 
agccacactt 
gccccaggca 
gggggcctat 
catacgcaca 
caatacactt 
tcatcccctc 
gggcgaggaa 
agggtctggg 
ccaggaaaaa 
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92641 
92701 
92761 
92821 
92881 
92941 
93001 
93061 
93121 
93181 
93241 
93301 
93361 
93421 
93481 
93541 
93601 
93661 
93721 
93781 
93841 
93901 
93961 
94021 
94081 
94141 
94201 
94261 
94321 
94381 
94441 
94501 
94561 
94621 
94681 
94741 
94801 
94861 
94921 
94981 
95041 
95101 
95161 
95221 
95281 
95341 
95401 
95461 
95521 
95581 
95641 
95701 
95761 
95821 
95881 
95941 
96001 
96061 
96121 
96181 
96241 
96301 
96361 
96421 



gacacgcaag 
ttcaacattt 
ataggtaatg 
ataggttcca 
gttaatctac 
gcatttgtct 
caagataagt 
aaaatttggg 
gtatatgttg 
tggagtgact 
agatttgatt 
gtagaagcaa 
caccttccgt 
agtcagaaaa 
aagaattcaa 
acaaagctag 
ctagtactct 
ttttagtctt 
gcataaggag 
ctcagagtag 
cctgtatgaa 
ggcctgtcag 
tataggcaaa 
catgtaatta 
tttttttttt 
cttgctggat 
cacaacaatg 
ctttaggtta 
tgttttgaga 
tcactgcaac 
tgggactaca 
ggtttcacca 
gcctcccaaa 
ttttaaaatc 
ctggtttgct 
gaaaacattt 
aaatgagtta 
tttgtttcat 
caagtattct 
acaaaactcc 
gccacaggga 
aacatgctcc 
acatcataga 
ggatatttgg 
ttttcttgct 
taaattcagg 
caatatttct 
tctccatttg 
ggatgagaag 
tcaggctagt 
gggaaacttg 
tccaatgacc 
tagccaggtt 
aagatttctt 
ccataaatgt 
ctgggctgct 
atcagcaagt 
tacaaagatc 
atcacatgct 
ggtgcacRta 
aaccggtctg 
gtactgcagg 
ccagagcggc 
ttaatattcc 



tcacaggaag 
aaatgtgtaa 
agctgagtgg 
tgtgaaaggg 
attttctata 
gagYgtgggc 
tgccagtttg 
gacctacaag 
ccttccatct 
ctgtttccag 
ttcctttcac 
aagaatctcc 
ttgcNtgcag 
atgagagaaa 
tatctagtgc 
aatctcataa 
atttctacca 
ctaatacttg 
tcttttttaa 
gccYtttaaa 
ttgggtgagt 
aaagtgatat 
ggacttaata 
ccttcttcca 
tttccagttt 
acaaagtttc 
tgaatgtaat 
ggaatatttt 
tggagtcttg 
ctccacctcc 
ggtgcatgcc 
tgttagccag 
gtgctgagat 
cttgaaaacc 
aaacattttt 
tcttcctagc 
ctcagaaaga 
tttctggacc 
gaaaatStaa 
aaataccatc 
gctgggtatg 
tgaaacagac 
gtgtgcccca 
catcagacag 
tgtagaataa 
cagggtgggg 
gagcactcac 
taactgaaga 
cagtgaaagg 
gccttagaga 
atatgaaagg 
atcttgagtt 
gaaaattcaa 
ctactggggg 
tggccagctg 

gggggtgaca 

ttttattcag 
acatgcttca 
tctgagggaa 
ttgtcttgat 
accacaaatt 
agaccagggc 
cgtttataga 
ttgctgggaa 



atctggggcc 
aatcaggcag 
tcacattttt 
ggttagagga 
agataaaaat 
agaagggtca 
cattgtcaca 
gattttcttt 
ttgcaaatat 
gcttaactgg 
attttctctt 
agtcacgggg 
gggtttagaa 
aacaaaagga 
ggataaattg 
cagacgtttt 
aagacaaatc 
gcctgattat 
aatttgactt 
accttgagct 
tcctttcttc 
tctttattta 
aaggactgaa 
tttgatattg 
tgctgtcagg 
cattttgcaa 
tccactgaac 
accacaattt 
ctctgtcacc 
caggttcaag 
accacgctca 
gatggtttca 
gacaggagtg 
cacctttgca 
acacacataa 
ctcgtttccc 
ggcagagctg 
atcctgtttc 
ttttcttcct 
agaatgtgtt 
gctgggcYgg 
cccatgtcca 
atgtcctgtc 
ctggaaatca 
agctaatcat 
aatatgcatg 
tctgtataca 
aaagttcagc 
aaagtgactt 
aaccatttca 
catgcaagag 
attgccccat 
tgtagccttg 
aacacactcc 
agaaataaag 
tcacatatca 
ggttttaaaa 
aagggcaaaa 
caggacaaag 
aaacatctta 
taccagggtg 
atatctcagt 
cctcccccca 
aagaatttgg 



cactcattat 
gcgtggaagg 
gtgaggcttt 
acagtccatt 
aaacagaaag 
cttccactct 
gtgaaattat 
gtgaggaatt 
ctgtttagaa 
ttccactggt 
tgtttttcaa 
ttggtctgaa 
aaaggacagt 
atatgttagt 
tagacaaata 
atagtttctt 
atagtaggac 
ttgcataaag 
tgctaggcca 
cagccacgaa 
tcgaggcccc 
ccacagttca 
attaatttca 
gcttagcaac 
gcccagggga 
gaagaaaagt 
tgtacaccta 
aatttttgtt 
caggctggag 
caattctcct 
gctaattttt 
atctcctgat 
agccactgca 
gaaaaaaaaa 
tcatacaagt 
agaagagaaa 
ggcttgtagc 
ccctaaagga 
cactttctcc 
gcccaagtcc 
ttatgaccaa 
gaaatttgca 
cagatttctg 
ctttccccag 
agaatttacc 
gcatatagta 
gggcagtgag 
tgctcactgc 
cattccaaaa 
gattttgaac 
tggtgaagaa 
cgggtgtatg 
aggcaattct 
caacatttca 
agagacagta 
gcaggactgt 
aggggagggt 
agcagaacta 
ggcaaaagca 
aacaacagaa 
gactttttcc 
ccttatctca 
gaaatgcatt 
caatatcttc 



tcccagaggg 
agggaaggaa 
gatcggccct 
attcattctt 
tagaggaagt 
tgtcattgtt 
acaaagtctg 
tgtgaggcag 
acaaaaacaa 
atagtgaatt 
tatcttttgg 
ttctgtaaca 
tttgatttct 
ttggagactt 
ataaagctgg 
ttaaaacaca 
gcatttattt 
tgcagaagaa 
ttttcgtatt 
tttatgtttg 
atgatcactt 
ggaaccctgt 
gtcctttagt 
cttcctgaac 
gggagaataa 
tctgtggatg 
aaaatggtta 
tgtttgtttg 
tgcagtggcg 
gcctcatctt 
tgtattttca 
gttgtgatcc 
ccaggcccaa 
taatagcacc 
ttcaacaaca 
acaaaggttc 
tggacctttc 
aatgatttcc 
ctgttggctc 
cacctgttct 
ctgtaacccc 
aaacctaccc 
tccctcccat 
tttttggctt 
tctcagaatt 
gatgctcatt 
caatcatgat 
ttgcaaaggc 
gcccatatct 
tgagggcagg 
gtgccagtct 
gactggggac 
gtagcttcct 
atggaggttc 
caaagagagg 
gatgcccacc 
gtgtaagaac 
ctaataaggg 
gaaatactga 
aacagggttt 
ccaccctaat 
accgcataag 
cctttcccag 
cctacttgcc 



ttttgagggc 
aaaagacagg 
ccctgaatcc 
cttgtgcaca 
aggcatatgt 
aggtaccagt 
tctcagggtg 
gccaactggg 
aggcaggttc 
tggggccctg 
agaaaacagt 
caaagtattc 
ggtgatgcca 
gtagccagga 
aaaacaatgg 
ttttctctct 
gcaaaataag 
taatcttgtg 
cataagctat 
cctgcagata 
gggcttcctg 
aaaggaactc 
aattcagtcc 
acatcagctt 
ggagttatgg 
atagcggttg 
aaatgatcaa 
tttgcttgtt 
caatctcggc 
ccccagcagc 
gtagagacag 
acctgccttg 
ttaaaaaagt 
tgagcctttt 
gccctgtgat 
aaatggggtg 
tggttcttgt 
tgaaatttct 
aagatgaatc 
atggctgatg 
atcaaggatt 
tgaaatgttg 
gagctggaga 
catgactcag 
cctaagagaa 
aaacaagaat 
gtctgcatta 
aaatgaacaa 
ggggatgagg 
tgtttcaaaa 
acatgacttg 
atctccagga 
ggatggtttc 
tttctatttt 
aattttacag 
tgaatctcag 
agggagtagg 
tctaacaaag 
tatgttcagc 
gagggtagag 
aagcctgagg 
acagacatcc 
ggtattaata 
catctgttta 
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96481 
96541 
96601 
96661 
96721 
96781 
96841 
96901 
96961 
97021 
97081 
97141 
97201 
97261 
97321 
97381 
97441 
97501 
97561 
97621 
97681 
97741 
97801 
97861 
97921 
97981 
98041 
98101 
98161 
98221 
98281 
98341 
98401 
98461 
98521 
98581 
98641 
98701 
98761 
98821 
98881 
98941 
99001 



taggctctct 
cttatggttg 
actgatttca 
tatcactgtg 
attaaagtaa 
catgaaatct 
ctccctgact 
aaatacagtg 
ctaaaagcaa 
aaatgggata 
tctcctatcc 
ccacacaact 
gctggggtgc 
caagcagctg 
tagagatgag 
cttctgcttc 
actttaacat 
aaattccttt 
tgatgtgggt 
caggtgcaaa 
taaagtccaa 
aacaagacac 
cctttctaaa 
atcattcccc 
ctgtgttttt 
agggagccaa 
gcttaaatta 
acagcacagc 
ccagtttccc 
gaaactcagt 
acaaggtcct 
gatggtcaca 
aacctagggg 
ctcccaatta 
accaacgttg 
ttgccactca 
agcccaaaga 
caagttgcag 
tggagttgcc 
aaagacgaaa 
aattttcatc 
gcagctcgtg 
gtcccaccct 



gcaagaagaa 
tcttcccttg 
tattgttcaa 
gtcctgaggt 
agacaggcat 
tcacaattta 
tcctgcaaca 
catgggaggg 
aacaagaaat 
cttggttaca 
cYatgtacag 
ggccttggct 
agcagcagat 
aaactacagg 
gtctcatgat 
agcctcccaa 
tatttttctt 
cccctccctt 
gcatagaaat 
attctccggc 
atgatgccaa 
agacctgcac 
cccctctctc 
cactgtgagc 
ggccactgag 
tgcacccagc 
tagaaatttt 
tggaaaaata 
ccaacggtcc 
ctcagctgcc 
aggagaagtc 
aaggcagaat 
tgggccttga 
ccgatatatt 
ctaggcagtt 
gagggagttg 
acagaagagg 
acatcactgc 
agggatactg 
aaaaagagag 
ttttgccacc 
atcacgcaaa 
ccttgctgct 



aagtatagct 
ttccctaaaa 
ccacacatgt 
gacgtacatc 
aagaaattat 
tgttcctctg 
tcttccctgg 
gagttcctag 
ggacaaagag 
ttttcatctt 
accagaaaca 
tccttttttg 
caaacgtctg 
catgtgccac 
gttgcccagg 
tgtactggag 
tctgtcttaa 
tctcaccaag 
tcaaggagtg 
ttagggggag 
ttggacctcc 
cctcctgcac 
cagcctaaag 
acttgaatga 
tggtgagcag 
atggcttccc 
ttcatttcta 
gttgcaaaac 
tccagttcat 
tgaagtggtc 
aacatcaaag 
tcagaaggag 
gttacttagg 
caatcaatga 
tcagtggcag 
ttgttttcca 
acacaaaaca 
tctcaccttc 
ggcagtcctg 
gaaataaaaa 
tggagaacct 
cccactaaac 
gcacttacag 



ctttttgccc 
tcgctgttat 
tttacaatca 
ctcagcttag 
aaaagtatta 
ctgcggctcc 
aatttctaag 
tagaaaagga 
aggagaccag 
ctccatgcac 
gactcagaaa 
ttagagacag 
ggctcaagtg 
catgtctggc 
ctggtcttaa 
ttgtaatcga 
tctcagaatg 
ctctcttgtg 
aattttgaaa 
ttaccaacaa 
tgacgggcta 
cactcccaca 
tagaaatgat 
agctgctttc 
ctggacttga 
tttttcacct 
agacaaataa 
tagcacagag 
ctgatgtagg 
acttatccat 
ccacctccaa 
gaagagactt 
tcaccgatgc 
tataggagat 
gtggttttgc 
cgtggaagct 
caggaatctt 
cattggccac 
tgcctgtgga 
cgggtaaaat 
gaaatggaaa 
acactgctcc 
gagcaacccc 



gaccctgcag 
tctgttcttt 
atttgtacag 
gaagataaca 
tttgggaact 
agccggtccc 
caatctaaca 
gaacaaagat 
aaacaaacaa 
tctctagtac 
ataaaatcat 
gatctcactc 
atccccccac 
taatttttta 
actcctagcc 
gtaacccagc 
tagctttaaa 
cacagtgctc 
aaaaaacagg 
ttaatccacc 
cttctcaaga 
cacctccaat 
cttttaagag 
ccttcaccgc 
gccagttata 
actgatcctc 
tgatagactc 
aggccctgca 
agctggactg 
gcgctccctt 
gctgtggtca 
agagccaagg 
caagctgcca 
tgctgaactg 
tcccgaaggt 
ggcgcttacc 
ccccttaaga 
aatttaggaa 
aactcagcag 
gctcaatttt 
cgaaaagtca 
ccggcaccac 
agacccagtg 



gcagtcagac 
ttcaaggtgg 
ttcacataat 
ggattaagag 
gataaatgtc 
tccattcggg 
tgattaaata 
tcacacatta 
acaaaaaata 
tcttgatgtt 
ttccccaagg 
tgttgcccag 
tgcaggaacc 
actttcttgg 
tcaagtaacc 
attgaaatgt 
atgtacttca 
acttatctaa 
cagagaaacc 
accaacaggg 
cagccattgg 
tccaattttc 
catgagcctg 
aactcacttc 
gcattacagg 
acgattcagt 
acaagacaaa 
gactctttgc 
ggctggtctg 
agaggggatg 
gcccaaagga 
gtggagatgg 
taacaaagag 
agacattcac 
cgaccagggg 
catacatccc 
gaaacaccta 
gaagacccca 
cYtgatgatc 
ataactttta 
caacttctca 
cttcagcccg 



[0242] The following cDNA sequences 1 -6 show human cDNA structure for transcript variants 
1-6 of APOL3. cDNA sequence 1 encodes the longest isoform (1), which is shown in amino acid 
sequence 1 . cDNA sequence 2 differs in the coding region but maintains the reading frame compared to 
the variant in cDNA sequence 1 . This variant encodes isoform 2, which is shorter and which contains a 
unique N-terminus compared to isoform 1 . cDNA sequence 3 has multiple differences in the coding 
region but maintains the reading frame compared to the variant in cDNA sequence 1 . This variant 
encodes isoform 2, which is shorter and which contains a unique N-terminus compared to isoform 1 . 
cDNA sequence 4 differs in the coding region but maintains the reading frame compared to the variant in 
cDNA sequence 1 . This variant encodes isoform 2 (amino acid sequence 2), which is shorter and which 
contains a unique N-terminus compared to isoform 1 . cDNA sequence 5 differs in the 5' UTR and 
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coding region compared to the variant in cDNA sequence 1 . This results in translation initiation from a 
downstream ATG and an isoform 3 (amino acid sequence 3) with a shorter N-terminus compared to 
isoform 1 . cDNA sequence 6 differs in the 5' UTR and coding region compared to the variant in amino 
acid sequence 1. This results in translation initiation from a downstream ATG and an isoform (3) with a 
shorter N-terminus compared to isoform 1 . 

APOL3 cDNA SEQUENCE 1 (SEP ID NO: 21 

NM_1 45640 Homo sapiens apolipoprotein L, 3 (APOL3), transcript variant alpha/d, mRNA 



l 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 



agcaggaggg 
gctgggaagc 
ctttcccttt 
cagatgcacg 
ctttcaagag 
gggagagagt 
gattcgtgac 
agaagcttag 
ttagggaatg 
aaaagcttcg 
atgtggtgtc 
tggcaccatt 
cagcgtctgc 
cagaagctga 
aagttatgcg 
cacaaaccat 
ctgtgaccac 
gcaccacccg 
tccttgcact 
agtctgcatc 
agctcactca 
cagccagcag 
gggataaaga 
tagctgagca 
gaatgctgga 
agaaatttcc 
aaccagcaat 
gaacttttgg 
gtccttccaa 
cactctgtct 
ggtggaggtg 
cctgcttgaa 
ggtgtctcta 
ctcattactc 
ctgggggagg 
aactcaccta 



tgggaccaag 
atcctgtttt 
tgggttccag 
gctggaggtg 
aagcttcctt 
cagcccagtg 
tgcggctgaa 
aacatatgca 
gtttttgaaa 
tgcccttgca 
cagctccact 
tacagcaggg 
tgtgactggg 
agccagcagg 
tgacatcaca 
tgggagtgaa 
ctggcgaatc 
ggcagtgagc 
ggatgtggtc 
tgctgaggag 
gatctatcag 
gggaggtgag 
gggcaaggta 
cagcagggga 
gcctggaata 
tttagactaa 
gagaaggcca 
atacagcaga 
gaatgaagtc 
tgtcctcctg 
gtggttgtag 
ccctgaaggc 
aaaacccaaa 
atatacagca 
ccggatctct 
ctcaccaa 



ggtgctgctg 
gcatgtttga 
ggtatatctc 
ggatccacac 
gaaaagaaac 
catctgcaaa 
ttgcccaggg 
gctattgagg 
gagtttcccc 
aatggtattg 
ggcgctgcct 
acgagtctgg 
atcaccacca 
ctgactgcaa 
cccaacttac 
atccgtgcca 
tcagctggaa 
agaggagccc 
aaccttgtat 
ctgaggcggc 
cgtctgaatc 
ccatacacag 
aagtttatgg 
ggggttaatg 
agggaggaga 
cgaatatatt 
ggaaaagaaa 
agaaacagcg 
tttccctggt 
gaagtgtatc 
tgtgatggat 
caggtaatga 
catcctggag 
ggcaaagaga 
agagccatcc 



gaccaaggat 
tcaggagctg 
agagcctgga 
agctcagaac 
gctttactga 
tcctgctgac 
atgaggcaga 
acgaatatgt 
aagtcaagag 
aagaggtcca 
ctggcatcat 
cccttactgc 
gcatcgtgga 
ccagcattga 
tttcccttct 
tcaggcaagc 
gtggtggtca 
ggatcctgag 
acgagtcaaa 
aggctcagga 
catgccatac 
gccacgacaa 
agctgagtgt 
cagatggcaa 
ggggactgga 
ggggggagga 
gagctgaaaa 
gctccactac 
gatggtcccc 
tcagtcagcc 
cccctttagg 
gccatggcca 
agtatgcgag 
cagaaaatta 
tgctgagtgc 



gggactgggc 
ctgccaagtt 
gaacgtgtct 
agctggatct 
agaggccacc 
taacaatgaa 
tgctctctac 
gcagcagaaa 
gaagatccag 
cagaggctgc 
gtcccttgct 
agctggggta 
gcactcatac 
ccgattgaag 
taataattat 
cagagccagg 
agcagagaga 
tgcgaccact 
gcacttgcat 
gctggaggag 
ccactgaccc 
aatgcaggca 
tagtgacttt 
gtgcaccaag 
gagtgtgggg 
atagagggga 
tgcagaaagc 
cgacctgccc 
tgccctgtct 
agtggcttct 
ttatttaggg 
ttgtccccag 
aacctaccaa 
actgaaaagc 
cctgtgtgta 



caagggtggg 
gtgactttca 
ggttattatg 
tgctcacact 
aaatacttcc 
gcctggaaga 
gaagctctga 
gatgagcagt 
gagtccatag 
accatctcca 
ggtcttgttt 
gggctgggag 
acatcatcag 
gtatttaagg 
tacgaagcca 
gcccgactcc 
acgattgcag 
tcaggcatct 
gagggggcaa 
aatctaatgg 
cagaccagtg 
ttttattagg 
ggcatttctg 
gagaaggcag 
aataggaaga 
ggtgtgcagg 
cgaagagtta 
ccggttcgat 
ttccagcatc 
tgatgatggc 
gtatatgtcc 
ctgaggacca 
gaaaaacagt 
agtttagaga 
agtcctaata 



APOL3 cDNA SEQUENCE 2 (SEP ID NO: 3) 

NM_0 14349 Homo sapiens apolipoprotein L, 3 (APOL3), transcript variant alpha/a, mRNA 



1 agcaggaggg tgggaccaag ggtgctgctg gaccaaggat gggactgggc caagggtggg 

61 gctgggaagc atcctgtttt gcatgtttga tcaggagctg ctgccaagtt gtgactttca 

121 ctttcccttt tgggttccag ggtatatctc agagcctgga gaacgtgtct ggttattatg 

181 cagatgcacg gctggaggtg ggatccacac agctcagaac agctggatct tgctcacact 
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241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 
2341 
2401 



ctttcaagag 
gggagctgga 
aaacaccggg 
gcccctccaa 
cctcctcaag 
ccatggaaag 
tactgaagag 
gctgactaac 
ggcagatgct 
atatgtgcag 
caagaggaag 
ggtccacaga 
catcatgtcc 
tactgcagct 
cgtggagcac 
cattgaccga 
ccttcttaat 
gcaagccaga 
tggtcaagca 
cctgagtgcg 
gtcaaagcac 
tcaggagctg 
ccatacccac 
cgacaaaatg 
gagtgttagt 
tggcaagtgc 
actggagagt 
ggaggaatag 
tgaaaatgca 
cactaccgac 
gtcccctgcc 
tcagccagtg 
tttaggttat 
tggccattgt 
tgcgagaacc 
aaattaactg 
gagtgccctg 



aagcttcctt 
gcaagtagaa 
ctaggttggt 
cctccctctg 
ggttaagaaa 
tctctcagtg 
gccaccaaat 
aatgaagcct 
ctctacgaag 
cagaaagatg 
atccaggagt 
ggctgcacca 
cttgctggtc 
ggggtagggc 
tcatacacat 
ttgaaggtat 
aattattacg 
gccagggccc 
gagagaacga 
accacttcag 
ttgcatgagg 
gaggagaatc 
tgaccccaga 
caggcatttt 
gactttggca 
accaaggaga 
gtggggaata 
aggggaggtg 
gaaagccgaa 
ctgcccccgg 
ctgtctttcc 
gcttcttgat 
ttaggggtat 
ccccagctga 
taccaagaaa 
aaaagcagtt 
tgtgtaagtc 



ggacaaaagg 
tttctctaaa 
cttggcattt 
tccacaggtg 
aaaaacgaac 
acctggctgc 
acttccggga 
ggaagagatt 
ctctgaagaa 
agcagtttag 
ccatagaaaa 
tctccaatgt 
ttgttttggc 
tgggagcagc 
catcagcaga 
ttaaggaagt 
aagccacaca 
gactccctgt 
ttgcaggcac 
gcatcttcct 
gggcaaagtc 
taatggagct 
ccagtgcagc 
attaggggga 
tttctgtagc 
aggcaggaat 
ggaagaagaa 
tgcaggaacc 
gagttagaac 
ttcgatgtcc 
agcatccact 
gatggcggtg 
atgtcccctg 
ggaccaggtg 
aacagtctca 
tagagactgg 
ctaataaact 



accctgcctt 
taccagctgg 
gctgacacgc 
gctcacattc 
ccttccagtc 
tggcaccatg 
gagagtcagc 
cgtgactgcg 
gcttagaaca 
ggaatggttt 
gcttcgtgcc 
ggtgtccagc 
accatttaca 
gtctgctgtg 
agctgaagcc 
tatgcgtgac 
aaccattggg 
gaccacctgg 
cacccgggca 
tgcactggat 
tgcatctgct 
cactcagatc 
cagcagggga 
taaagagggc 
tgagcacagc 
gctggagcct 
atttccttta 
agcaatgaga 
ttttggatac 
ttccaagaat 
ctgtcttgtc 
gaggtggtgg 
cttgaaccct 
tctctaaaaa 
ttactcatat 
gggaggccgg 
cacctactca 



ggtgtgagag 
ctggggccca 
aaagggattg 
agtcccacaa 
aggtcagtga 
gactcagaaa 
ccagtgcatc 
gctgaattgc 
tatgcagcta 
ttgaaagagt 
cttgcaaatg 
tccactggcg 
gcagggacga 
actgggatca 
agcaggctga 
atcacaccca 
agtgaaatcc 
cgaatctcag 
gtgagcagag 
gtggtcaacc 
gaggagctga 
tatcagcgtc 
ggtgagccat 
aaggtaaagt 
aggggagggg 
ggaataaggg 
gactaacgaa 
aggccaggaa 
agcagaagaa 
gaagtctttc 
ctcctggaag 
ttgtagtgtg 
gaaggccagg 
cccaaacatc 
acagcaggca 
atctctagag 
ccaa 



tgagggcaga 
ggagattaaa 
cagagatcca 
tttgctttct 
ctggagagct 
agaaacgctt 
tgcaaatcct 
ccagggatga 
ttgaggacga 
ttccccaagt 
gtattgaaga 
ctgcctctgg 
gtctggccct 
ccaccagcat 
ctgcaaccag 
acttactttc 
gtgccatcag 
ctggaagtgg 
gagcccggat 
ttgtatacga 
ggcggcaggc 
tgaatccatg 
acacaggcca 
ttatggagct 
ttaatgcaga 
aggagagggg 
tatattgggg 
aagaaagagc 
acagcggctc 
cctggtgatg 
tgtatctcag 
atggatcccc 
taatgagcca 
ctggagagta 
aagagacaga 
ccatcctgct 



APOL3 cDNA SEQUENCE 3 (SEP ID NO: 4) 

NM_030644 Homo sapiens apolipoprotein L, 3 (APOL3), transcript variant alpha/b, mRNA 



l 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 



agcaggaggg 
gctgggaagc 
ctttcccttt 
cagatgcacg 
ctttcaagag 
gggagctgga 
aaacaccggg 
agctccatgg 
gctttactga 
tcctgctgac 
atgaggcaga 
acgaatatgt 
aagtcaagag 
aagaggtcca 
ctggcatcat 
cccttactgc 
gcatcgtgga 
ccagcattga 
tttcccttct 
tcaggcaagc 



tgggaccaag 
atcctgtttt 
tgggttccag 
gctggaggtg 
aagcttcctt 
gcaagtagaa 
ctagggttaa 
aaagtctctc 
agaggccacc 
taacaatgaa 
tgctctctac 
gcagcagaaa 
gaagatccag 
cagaggctgc 
gtcccttgct 
agctggggta 
gcactcatac 
ccgattgaag 
taataattat 
cagagccagg 



ggtgctgctg 
gcatgtttga 
ggtatatctc 
ggatccacac 
ggacaaaagg 
tttctctaaa 
gaaaaaaaac 
agtgacctgg 
aaatacttcc 
gcctggaaga 
gaagctctga 
gatgagcagt 
gagtccatag 
accatctcca 
ggtcttgttt 
gggctgggag 
acatcatcag 
gtatttaagg 
tacgaagcca 
gcccgactcc 



gaccaaggat 
tcaggagctg 
agagcctgga 
agctcagaac 
accctgcctt 
taccagctgg 
gaacccttcc 
ctgctggcac 
gggagagagt 
gattcgtgac 
agaagcttag 
ttagggaatg 
aaaagcttcg 
atgtggtgtc 
tggcaccatt 
cagcgtctgc 
cagaagctga 
aagttatgcg 
cacaaaccat 
ctgtgaccac 



gggactgggc 
ctgccaagtt 
gaacgtgtct 
agctggatct 
ggtgtgagag 
ctggggccca 
agtcaggtca 
catggactca 
cagcccagtg 
tgcggctgaa 
aacatatgca 
gtttttgaaa 
tgcccttgca 
cagctccact 
tacagcaggg 
tgtgactggg 
agccagcagg 
tgacatcaca 
tgggagtgaa 
ctggcgaatc 



caagggtggg 
gtgactttca 
ggttattatg 
tgctcacact 
tgagggcaga 
ggagattaaa 
gtgactggag 
gaaaagaaac 
catctgcaaa 
ttgcccaggg 
gctattgagg 
gagtttcccc 
aatggtattg 
ggcgctgcct 
acgagtctgg 
atcaccacca 
ctgactgcaa 
cccaacttac 
atccgtgcca 
tcagctggaa 
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1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 



gtggtggtca 
ggatcctgag 
acgagtcaaa 
aggctcagga 
catgccatac 
gccacgacaa 
agctgagtgt 
cagatggcaa 
ggggactgga 

ggggggagga 
gagctgaaaa 
gctccactac 
gatggtcccc 
tcagtcagcc 
cccctttagg 
gccatggcca 
agtatgcgag 
cagaaaatta 
tgctgagtgc 



agcagagaga 
tgcgaccact 
gcacttgcat 
gctggaggag 
ccactgaccc 
aatgcaggca 
tagtgacttt 
gtgcaccaag 
gagtgtgggg 
atagagggga 
tgcagaaagc 
cgacctgccc 
tgccctgtct 
agtggcttct 
ttatttaggg 
ttgtccccag 
aacctaccaa 
actgaaaagc 
cctgtgtgta 



acgattgcag 
tcaggcatct 
gagggggcaa 
aatctaatgg 
cagaccagtg 
ttttattagg 
ggcatttctg 
gagaaggcag 
aataggaaga 
ggtgtgcagg 
cgaagagtta 
ccggttcgat 
ttccagcatc 
tgatgatggc 
gtatatgtcc 
ctgaggacca 
gaaaaacagt 
agtttagaga 
agtcctaata 



gcaccacccg 
tccttgcact 
agtctgcatc 
agctcactca 
cagccagcag 
gggataaaga 
tagctgagca 
gaatgctgga 
agaaatttcc 
aaccagcaat 
gaacttttgg 
gtccttccaa 
cactctgtct 
ggtggaggtg 
cctgcttgaa 
ggtgtctcta 
ctcattactc 
ctgggggagg 
aactcaccta 



ggcagtgagc 
ggatgtggtc 
tgctgaggag 
gatctatcag 
gggaggtgag 
gggcaaggta 
cagcagggga 
gcctggaata 
tttagactaa 
gagaaggcca 
atacagcaga 
gaatgaagtc 
tgtcctcctg 
gtggttgtag 
ccctgaaggc 
aaaacccaaa 
atatacagca 
ccggatctct 
ctcaccaa 



agaggagccc 
aaccttgtat 
ctgaggcggc 
cgtctgaatc 
ccatacacag 
aagtttatgg 
ggggttaatg 
agggaggaga 
cgaatatatt 
ggaaaagaaa 
agaaacagcg 
tttccctggt 
gaagtgtatc 
tgtgatggat 
caggtaatga 
catcctggag 
ggcaaagaga 
agagccatcc 



APOL3 cDNA SEQUENCE 4 (SEP ID NO: 51 



NM 030644 Homo sapiens apolipoprotein L, 3 (APOL3), transcript variant alpha/c, mRNA 



l 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 



agcaggaggg 
gctgggaagc 
ctttcccttt 
cagatgcacg 
Ctttcaagag 
actggagagc 
aagaaacgct 

ctgcaaatcc 
cccagggatg 
attgaggacg 
tttccccaag 
ggtattgaag 
gctgcctctg 
agtctggccc 
accaccagca 
actgcaacca 
aacttacttt 
cgtgccatca 
gctggaagtg 
ggagcccgga 
cttgtatacg 
aggcggcagg 
ctgaatccat 
tacacaggcc 
tttatggagc 
gttaatgcag 
gaggagaggg 
atatattggg 
aaagaaagag 
aacagcggct 
ccctggtgat 
gtgtatctca 
gatggatccc 
gtaatgagcc 
cctggagagt 
aaagagacag 
gccatcctgc 



tgggaccaag 
atcctgtttt 
tgggttccag 
gctggaggtg 
aagcttcctt 
tccatggaaa 
ttactgaaga 
tgctgactaa 
aggcagatgc 
aatatgtgca 
tcaagaggaa 
aggtccacag 
gcatcatgtc 
ttactgcagc 
tcgtggagca 
gcattgaccg 
cccttcttaa 
ggcaagccag 
gtggtcaagc 
tcctgagtgc 
agtcaaagca 
ctcaggagct 
gccataccca 
acgacaaaat 
tgagtgttag 
atggcaagtg 
gactggagag 
gggaggaata 
ctgaaaatgc 
ccactaccga 
ggtcccctgc 
gtcagccagt 
ctttaggtta 
atggccattg 
atgcgagaac 
aaaattaact 
tgagtgccct 



ggtgctgctg 
gcatgtttga 
ggtatatctc 
ggatccacac 
gggttaagaa 
gtctctcagt 
ggccaccaaa 
caatgaagcc 
tctctacgaa 
gcagaaagat 
gatccaggag 
aggctgcacc 
ccttgctggt 
tggggtaggg 
ctcatacaca 
attgaaggta 
taattattac 
agccagggcc 
agagagaacg 
gaccacttca 
cttgcatgag 
ggaggagaat 
ctgaccccag 
gcaggcattt 
tgactttggc 
caccaaggag 
tgtggggaat 
gaggggaggt 
agaaagccga 
cctgcccccg 
cctgtctttc 
ggcttcttga 
tttaggggta 
tccccagctg 
ctaccaagaa 
gaaaagcagt 
gtgtgtaagt 



gaccaaggat 
tcaggagctg 
agagcctgga 
agctcagaac 
aaaaaacgaa 
gacctggctg 
tacttccggg 
tggaagagat 
gctctgaaga 
gagcagttta 
tccatagaaa 
atctccaatg 
cttgttttgg 
ctgggagcag 
tcatcagcag 
tttaaggaag 
gaagccacac 
cgactccctg 
attgcaggca 
ggcatcttcc 
ggggcaaagt 
ctaatggagc 
accagtgcag 
tattaggggg 
atttctgtag 
aaggcaggaa 
aggaagaaga 
gtgcaggaac 
agagttagaa 
gttcgatgtc 
cagcatccac 
tgatggcggt 
tatgtcccct 
aggaccaggt 
aaacagtctc 
ttagagactg 
cctaataaac 



gggactgggc 
ctgccaagtt 
gaacgtgtct 
agctggatct 
cccttccagt 
ctggcaccat 
agagagtcag 
tcgtgactgc 
agcttagaac 
gggaatggtt 
agcttcgtgc 
tggtgtccag 
caccatttac 
cgtctgctgt 
aagctgaagc 
ttatgcgtga 
aaaccattgg 
tgaccacctg 
ccacccgggc 
ttgcactgga 
ctgcatctgc 
tcactcagat 
ccagcagggg 
ataaagaggg 
ctgagcacag 
tgctggagcc 
aatttccttt 
cagcaatgag 
cttttggata 
cttccaagaa 
tctgtcttgt 
ggaggtggtg 
gcttgaaccc 
gtctctaaaa 
attactcata 
ggggaggccg 
tcacctactc 



caagggtggg 
gtgactttca 
ggttattatg 
tgctcacact 
caggtcagtg 
ggactcagaa 
cccagtgcat 
ggctgaattg 
atatgcagct 
tttgaaagag 
ccttgcaaat 
ctccactggc 
agcagggacg 
gactgggatc 
cagcaggctg 
catcacaccc 
gagtgaaatc 
gcgaatctca 
agtgagcaga 
tgtggtcaac 
tgaggagctg 
ctatcagcgt 
aggtgagcca 
caaggtaaag 
caggggaggg 
tggaataagg 
agactaacga 
aaggccagga 
cagcagaaga 
tgaagtcttt 
cctcctggaa 
gttgtagtgt 
tgaaggccag 
acccaaacat 
tacagcaggc 
gatctctaga 
accaa 
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APOL3 cDNA SEQUENCE 5 (SEP ID NO: 6) 

NM_1 45641 Homo sapiens apolipoprotein L, 3 (APOL3), transcript variant beta/a, mRNA 



l 
61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 
2341 
2401 
2461 
2521 
2581 
2641 
2701 
2761 
2821 
2881 
2941 
3001 
3061 
3121 
3181 
3241 
3301 
3361 
3421 
3481 



actctgggga 
cttggggtga 
ccacctgtgc 
tccatccatt 
gtttcactgt 
taactaccta 
caattaatct 
tgctctcttg 
tagatatcag 
tggattctag 
ctataagttg 
ccagttgtta 
ggccctgtct 
gcaatgttta 
tcatcaattt 
ctggttagtt 
aatggagtta 
tcagtgctgt 
tgctatcgat 
agcagacaag 
aaatatcaac 
cctcagaagt 
gaatgaaaga 
tgtgaaatta 
aaccagaagt 
ccagaatagc 
aacacaaaga 
acataaggaa 
gttattatgc 
gctcacactc 
aatacttccg 
cctggaagag 
aagctctgaa 
atgagcagtt 
agtccataga 
ccatctccaa 
gtcttgtttt 
ggctgggagc 
catcatcagc 
tatttaagga 
acgaagccac 
cccgactccc 
cgattgcagg 
caggcatctt 
agggggcaaa 
atctaatgga 
agaccagtgc 
tttattaggg 
gcatttctgt 
agaaggcagg 
ataggaagaa 
gtgtgcagga 
gaagagttag 
cggttcgatg 
tccagcatcc 
gatgatggcg 
tatatgtccc 
tgaggaccag 
aaaaacagtc 



aaggagggta 
cctgagggca 
cttactttct 
aggatggcat 
cagcctttct 
atatttcccc 
ggctacttcc 
tagctggtag 
acatggttga 
gtgggaagaa 
tataaagctg 
aatttccacc 
gtaatttttt 
tcaagtgttg 
cataagacta 
agattaataa 
aggaggtcct 
taaaactcct 
aagagtgagt 
cagagatatg 
agtgatacta 
ggaagtcaag 
ggaatcaaaa 
tggggagctg 
tgttatgttt 
tggcttcttt 
agaattagaa 
gtgtccagaa 
agatgcacgg 
tttcaagaga 
ggagagagtc 
attcgtgact 
gaagcttaga 
tagggaatgg 
aaagcttcgt 
tgtggtgtcc 
ggcaccattt 
agcgtctgct 
agaagctgaa 
agttatgcgt 
acaaaccatt 
tgtgaccacc 
caccacccgg 
ccttgcactg 
gtctgcatct 
gctcactcag 
agccagcagg 
ggataaagag 
agctgagcac 
aatgctggag 
gaaatttcct 
accagcaatg 
aacttttgga 
tccttccaag 
actctgtctt 
gtggaggtgg 
ctgcttgaac 
gtgtctctaa 
tcattactca 



caaccacatg taaattttat 
tgtttttgtg ttactgggca 
ctttctttgg gatgtgctgg 
taggacggtt ctggggctaa 
actctctttt cttacccact 
ctttagagaa aaaaagccaa 
tgctgacaag aggcagtggt 
gttgggcaga gaaaagtggt 
gacctcgcgg taaccttgtg 
acaaaacaac cttgtaaatc 
ttttaggacc aagaaagggg 
agtcaaagcc tcctgaaact 
taagttggtt tgcactttac 
cacaagctcc cccttgattg 
tgcttgctaa tgaagcaatt 
tttgttgagt aatttttaag 
ccagttcccg tgccaatacc 
ctttcagttt taccagattg 
ttggggatga tgtaaactag 
ccttgtttct atacaaaaat 
aagtagggtt tttccatgct 
gctagttcat gtaatggggg 
gcccaattag aggaaggagt 
cagttttttg caatgatttt 
aaaatgatat tctggtgtgg 
ccacatagtg gttctgttgg 
tatcaggtga aggtagcagg 
gccacacagg gtatatctca 
ctggaggtgg gatccacaca 
agcttccttg aaaagaaacg 
agcccagtgc atctgcaaat 
gcggctgaat tgcccaggga 
acatatgcag ctattgagga 
tttttgaaag agtttcccca 
gcccttgcaa atggtattga 
agctccactg gcgctgcctc 
acagcaggga cgagtctggc 
gtgactggga tcaccaccag 
gccagcaggc tgactgcaac 
gacatcacac ccaacttact 
gggagtgaaa tccgtgccat 
tggcgaatct cagctggaag 
gcagtgagca gaggagcccg 
gatgtggtca accttgtata 
gctgaggagc tgaggcggca 
atctatcagc gtctgaatcc 
ggaggtgagc catacacagg 
ggcaaggtaa agtttatgga 
agcaggggag gggttaatgc 
cctggaataa gggaggagag 
ttagactaac gaatatattg 
agaaggccag gaaaagaaag 
tacagcagaa gaaacagcgg 
aatgaagtct ttccctggtg 
gtcctcctgg aagtgtatct 
tggttgtagt gtgatggatc 
cctgaaggcc aggtaatgag 
aaacccaaac atcctggaga 
tatacagcag gcaaagagac 
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tataatgatg 
tgtgcccctt 
ccacagactt 
acttaggtgg 
cctagctgct 
atttttgggt 
aataattggg 
ggccatccaa 
tagaatcatt 
aaatgagcat 
gctaaccagg 
ctgttttcca 
ctgattggtt 
gctgtgagca 
tgttctgaaa 
aagagtttct 
agccagaacg 
agcgtgtata 
ggccaaatcc 
gtgatttgtc 
gtgtaagtct 
aacacaggca 
gatgggatcc 
gcccgaagtt 
gtctggggaa 
ggggcttggg 
tgctcctggc 
gagcctggag 
gctcagaaca 
ctttactgaa 
cctgctgact 
tgaggcagat 
cgaatatgtg 
agtcaagagg 
agaggtccac 
tggcatcatg 
ccttactgca 
catcgtggag 
cagcattgac 
ttcccttctt 
caggcaagcc 
tggtggtcaa 
gatcctgagt 
cgagtcaaag 
ggctcaggag 
atgccatacc 
ccacgacaaa 
gctgagtgtt 
agatggcaag 
gggactggag 
gggggaggaa 
agctgaaaat 
ctccactacc 
atggtcccct 
cagtcagcca 
ccctttaggt 
ccatggccat 
gtatgcgaga 
agaaaattaa 



gtataatgaa 
taggagactt 
taccaaaaac 
gccaggggct 
aatgtctatt 
agatcggtga 
ttctcttttt 
ggggcccacg 
tggagtttta 
cgtttgaaag 
aaaaccagga 
ttaacttgtt 
gatgaaaaca 
aattaaaagc 
gggtattgac 
gggcagtaaa 
ccaattatag 
gggaggggaa 
ccgtttaatt 
atgttaagac 
gctttaatac 
gtacttgaca 
caaccaatat 
gtgagactga 
aagggtagat 
aaatgaataa 
agaagaaccc 
aacgtgtctg 
gctggatctt 
gaggccacca 
aacaatgaag 
gctctctacg 
cagcagaaag 
aagatccagg 
agaggctgca 
tcccttgctg 
gctggggtag 
cactcataca 
cgattgaagg 
aataattatt 
agagccaggg 
gcagagagaa 
gcgaccactt 
cacttgcatg 
ctggaggaga 
cactgacccc 
atgcaggcat 
agtgactttg 
tgcaccaagg 
agtgtgggga 
tagaggggag 
gcagaaagcc 
gacctgcccc 
gccctgtctt 
gtggcttctt 
tatttagggg 
tgtccccagc 
acctaccaag 
ctgaaaagca 
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3541 gtttagagac tgggggaggc cggatctcta gagccatcct gctgagtgcc ctgtgtgtaa 
3601 gtcctaataa actcacctac tcaccaa 

APOL3 cDNA SEQUENCE 6 (SEP ID NO: 7) 

NM 145642 Homo sapiens apolipoprotein L, 3 (APOL3), transcript variant beta/a, mRNA 



1 

61 
121 
181 
241 
301 
361 
421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2221 
2281 
2341 
2401 
2461 
2521 
2581 
2641 
2701 
2761 
2821 
2881 
2941 
3001 
3061 
3121 
3181 
3241 



actctgggga 
cttggggtga 
ccacctgtgc 
tccatccatt 
gtttcactgt 
taactaccta 
caattaatct 
tgctctcttg 
tagatatcag 
tggattctag 
ctataagttg 
ccagttgtta 
ggccctgtct 
gcaatgttta 
tcatcaattt 
ctggttagtt 
aatggagtta 
tcagtgctgt 
tgctatcgat 
agcagacaag 
aaatatcaac 
cctcagaagt 
gaatgaaaga 
tgtgaaatta 
aaccagaagt 
ccagaatagc 
aacacaaaga 
attatgcaga 
cacactcttt 
acttccggga 
ggaagagatt 
ctctgaagaa 
agcagtttag 
ccatagaaaa 
tctccaatgt 
ttgttttggc 
tgggagcagc 
catcagcaga 
ttaaggaagt 
aagccacaca 
gactccctgt 
ttgcaggcac 
gcatcttcct 
gggcaaagtc 
taatggagct 
ccagtgcagc 
attaggggga 
tttctgtagc 
aggcaggaat 
ggaagaagaa 
tgcaggaacc 
gagttagaac 
ttcgatgtcc 
agcatccact 
gatggcggtg 



aaggagggta 
cctgagggca 
cttactttct 
aggatggcat 
cagcctttct 
atatttcccc 
ggctacttcc 
tagctggtag 
acatggttga 
gtgggaagaa 
tataaagctg 
aatttccacc 
gtaatttttt 
tcaagtgttg 
cataagacta 
agattaataa 
aggaggtcct 
taaaactcct 
aagagtgagt 
cagagatatg 
agtgatacta 
ggaagtcaag 
ggaatcaaaa 
tggggagctg 
tgttatgttt 
tggcttcttt 
agaattagaa 
tgcacggctg 
caagagaagc 
gagagtcagc 
cgtgactgcg 
gcttagaaca 
ggaatggttt 
gcttcgtgcc 
ggtgtccagc 
accatttaca 
gtctgctgtg 
agctgaagcc 
tatgcgtgac 
aaccattggg 
gaccacctgg 
cacccgggca 
tgcactggat 
tgcatctgct 
cactcagatc 
cagcagggga 
taaagagggc 
tgagcacagc 
gctggagcct 
atttccttta 
agcaatgaga 
ttttggatac 
ttccaagaat 
ctgtcttgtc 
gaggtggtgg 



caaccacatg 
tgtttttgtg 
ctttctttgg 
taggacggtt 
actctctttt 
ctttagagaa 
tgctgacaag 
gttgggcaga 
gacctcgcgg 
acaaaacaac 
ttttaggacc 
agtcaaagcc 
taagttggtt 
cacaagctcc 
tgcttgctaa 
tttgttgagt 
ccagttcccg 
ctttcagttt 
ttggggatga 
ccttgtttct 
aagtagggtt 
gctagttcat 
gcccaattag 
cagttttttg 
aaaatgatat 
ccacatagtg 
tatcagggta 
gaggtgggat 
ttccttgaaa 
ccagtgcatc 
gctgaattgc 
tatgcagcta 
ttgaaagagt 
cttgcaaatg 
tccactggcg 
gcagggacga 
actgggatca 
agcaggctga 
atcacaccca 
agtgaaatcc 
cgaatctcag 
gtgagcagag 
gtggtcaacc 
gaggagctga 
tatcagcgtc 
ggtgagccat 
aaggtaaagt 
aggggagggg 
ggaataaggg 
gactaacgaa 
aggccaggaa 
agcagaagaa 
gaagtctttc 
ctcctggaag 
ttgtagtgtg 



taaattttat 
ttactgggca 
gatgtgctgg 
ctggggctaa 
cttacccact 
aaaaagccaa 
aggcagtggt 
gaaaagtggt 
taaccttgtg 
cttgtaaatc 
aagaaagggg 
tcctgaaact 
tgcactttac 
cccttgattg 
tgaagcaatt 
aatttttaag 
tgccaatacc 
taccagattg 
tgtaaactag 
atacaaaaat 
tttccatgct 
gtaatggggg 
aggaaggagt 
caatgatttt 
tctggtgtgg 
gttctgttgg 
tatctcagag 
ccacacagct 
agaaacgctt 
tgcaaatcct 
ccagggatga 
ttgaggacga 
ttccccaagt 
gtattgaaga 
ctgcctctgg 
gtctggccct 
ccaccagcat 
ctgcaaccag 
acttactttc 
gtgccatcag 
ctggaagtgg 
gagcccggat 
ttgtatacga 
ggcggcaggc 
tgaatccatg 
acacaggcca 
ttatggagct 
ttaatgcaga 
aggagagggg 
tatattgggg 
aagaaagagc 
acagcggctc 
cctggtgatg 
tgtatctcag 
atggatcccc 



tataatgatg 
tgtgcccctt 
ccacagactt 
acttaggtgg 
cctagctgct 
atttttgggt 
aataattggg 
ggccatccaa 
tagaatcatt 
aaatgagcat 
gctaaccagg 
ctgttttcca 
ctgattggtt 
gctgtgagca 
tgttctgaaa 
aagagtttct 
agccagaacg 
agcgtgtata 
ggccaaatcc 
gtgatttgtc 
gtgtaagtct 
aacacaggca 
gatgggatcc 
gcccgaagtt 
gtctggggaa 
ggggcttggg 
cctggagaac 
cagaacagct 
tactgaagag 
gctgactaac 
ggcagatgct 
atatgtgcag 
caagaggaag 
ggtccacaga 
catcatgtcc 
tactgcagct 
cgtggagcac 
cattgaccga 
ccttcttaat 
gcaagccaga 
tggtcaagca 
cctgagtgcg 
gtcaaagcac 
tcaggagctg 
ccatacccac 
cgacaaaatg 
gagtgttagt 
tggcaagtgc 
actggagagt 
ggaggaatag 
tgaaaatgca 
cactaccgac 
gtcccctgcc 
tcagccagtg 
tttaggttat 



gtataatgaa 
taggagactt 
taccaaaaac 
gccaggggct 
aatgtctatt 
agatcggtga 
ttctcttttt 
ggggcccacg 
tggagtttta 
cgtttgaaag 
aaaaccagga 
ttaacttgtt 
gatgaaaaca 
aattaaaagc 
gggtattgac 
gggcagtaaa 
ccaattatag 
gggaggggaa 
ccgtttaatt 
atgttaagac 
gctttaatac 
gtacttgaca 
caaccaatat 
gtgagactga 
aagggtagat 
aaatgaataa 
gtgtctggtt 
ggatcttgct 
gccaccaaat 
aatgaagcct 
ctctacgaag 
cagaaagatg 
atccaggagt 
ggctgcacca 
cttgctggtc 
ggggtagggc 
tcatacacat 
ttgaaggtat 
aattattacg 
gccagggccc 
gagagaacga 
accacttcag 
ttgcatgagg 
gaggagaatc 
tgaccccaga 
caggcatttt 
gactttggca 
accaaggaga 
gtggggaata 
aggggaggtg 
gaaagccgaa 
ctgcccccgg 
ctgtctttcc 
gcttcttgat 
ttaggggtat 
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3301 atgtcccctg cttgaaccct gaaggccagg taatgagcca tggccattgt ccccagctga 

3361 ggaccaggtg tctctaaaaa cccaaacatc ctggagagta tgcgagaacc taccaagaaa 

3421 aacagtctca ttactcatat acagcaggca aagagacaga aaattaactg aaaagcagtt 

34 81 tagagactgg gggaggccgg atctctagag ccatcctgct gagtgccctg tgtgtaagtc 

3541 ctaataaact cacctactca ccaa 



[0243] The following are human polypeptide sequences for isoform 1, isoform 2 and isoform 3 of 
APOL3. 



APOL3 AMINO ACID SEQUENCE 1 (SEP ID NO: 8) 

NP 663615 Homo sapiens apolipoprotein L, 3 (APOL3), isoform 1, protein 

MGLGQGWGWEASCFACLIRSCCQVVTFTFPFGFQGISQSLENVSGYYADARLEVGSTQ 

LRTAGSCSHSFKRSFLEKKRFTEEATKYFRERVSPVHLQILLTNNEAWKRFVTAAELPR 

DEADALYEALKKLRTYAAIEDEYVQQKDEQFREWFLKEFPQVKRKIQESIEKLRALAN 

GIEEVHRGCTISNVVSSSTGAASGIMSLAGLVLAPFTAGTSLALTAAGVGLGAASAVTGI 

TTSrVEHSYTSSAEAEASRLTATSIDRLKVFKEVMRDITPNLLSLLNNYYEATQTIGSEIR 

AIRQARARARLPVTTWRISAGSGGQAERTIAGTTRAVSRGARILSATTSGIFLALDVVNL 

VYESKHLHEGAKSASAEELRRQAQELEENLMELTQIYQRLNPCHTH 



APOL3 AMINO ACID SEQUENCE 2 (SEP ID NO: 9) 

NP_055164 Homo sapiens apolipoprotein L, 3 (APPL3), isoform 2, protein 

MDSEKKRFTEEATKYFRERVSPVHLQILLTNlvIEAWKRFVTAAELPRDEADALYEALKK 

LRTYAAIEDEYVQQKDEQFREWFLKEFPQVKRKIQESIEKLRALANGIEEVHRGCTISNV 

VSSSTGAASGIMSLAGLVLAPFTAGTSLALTAAGVGLGAASAVTGITTSIVEHSYTSSAE 

AEASRLTATSIDRLKVFKEVMRDITPNLLSLLNNYYEATQTIGSEIRAIRQARARARLPVT 

TWRISAGSGGQAERTIAGTTRAVSRGARILSATTSGIFLALDVVNLVYESKHLHEGAKS 

ASAEELRRQAQELEENLMELTQIYQRLNPCHTH 

APOL3 AMINO ACID SEQUENCE 3 (SEP ID NO: 10) 

NP_663616 Homo sapiens apolipoprotein L, 3 (APPL3), isoform 3, protein 

MSLAGLVLAPFTAGTSLALTAAGVGLGAASAVTGITTSIVEHSYTSSAEAEASRLTATSI 
DRLKVFKEVMRDITPNLLSLLNNYYEATQTIGSEIRAIRQARARARLPVTTWRISAGSGG 
QAERTIAGTTRAVSRGARILSATTSGIFLALDVVNLVYESKHLHEGAKSASAEELRRQA 
QELEENLMELTQIYQRLNPCHTH 

[0244] Modifications may be made to the foregoing without departing from the basic aspects of the 
invention. Although the invention has been described in substantial detail with reference to one or more 
specific embodiments, those of skill in the art will recognize that changes may be made to the 
embodiments specifically disclosed in this application, yet these modifications and improvements are 
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within the scope and spirit of the invention, as set forth in the aspects which follow. All publications or 
patent documents cited in this specification are incorporated herein by reference as if each such 
publication or document was specifically and individually indicated to be incorporated herein by 
reference. 

[0245] Citation of the above publications or documents is not intended as an admission that any of 
the foregoing is pertinent prior art, nor does it constitute any admission as to the contents or date of these 
publications or documents. U.S. patents and other publications referenced herein are hereby incorporated 
by reference. 
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What is claimed is : 

1 . A method for identifying a subject at risk of osteoarthritis, which comprises detecting the 
presence or absence of one or more polymorphic variations associated with osteoarthritis in a nucleic acid 
sample from a subject, wherein the one or more polymorphic variations are detected in a nucleotide 
sequence selected from the group consisting of: 

(a) a nucleotide sequence in SEQ ID NO: 1-7; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-7; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1 -7; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c); 

whereby the presence of the polymorphic variation is indicative of the subject being at risk of 
osteoarthritis. 

2. The method of claim 1 , which further comprises obtaining the nucleic acid sample from the 

subject. 

3. The method of claim 1 , wherein the one or more polymorphic variations are detected within 
a region spanning chromosome positions 34828750 and 34833750 in human genomic DNA. 

4. The method of claim 1, wherein the one or more polymorphic variations are detected at one 
or more positions selected from the group consisting of 201, 425, 1095, 2201, 7879, 8395, 8461, 9503, 
10304, 10695, 16300, 16444, 17591, 17988, 19116, 19358, 20300, 20669, 20891, 21451, 21978, 22785, 
24248, 24770, 24844, 25066, 25096, 25309, 25344, 25529, 25537, 25554, 27963, 28134, 28356, 29648, 
29986, 30217, 30267, 30315, 30585, 30724, 30897, 30931, 31080, 31246, 31373, 31463, 31467, 32188, 
32288, 32520, 32594, 32657, 32677, 32764, 32784, 32830, 32872, 33121, 33348, 33952, 34184, 34361, 
35026, 35192, 35600, 36033, 36289, 38869, 39629, 40530, 41621, 42379, 42802, 42865, 43644, 45051, 
45828, 45829, 46257, 47286, 47427, 47963, 48013, 48229, 48282, 48376, 48404, 49900, 52699, 52897, 
53414, 53487, 541 12, 55492, 59766, 60307, 60701, 60952, 61401, 62379, 62870, 62879, 63499, 64284, 
64408, 64760, 65230, 66127, , 6634, 66686, 66694, 67113, 67257, 67403, 67609, 68418, 68610, 69629, 
70024, 70848, 71428, 71553, 71633, 71768 ,71769, 73039, 73325, 73412, 73547, 73769, 73806, 74467, 
74472, 74473, 74482, 74494, 74592, 74670, 74672, 74714, 74723, 74749, 74861, 74892, 74893, 75176, 
75705, 75989, 76027, 77949, 77974, 78167, 78310, 78415, 78575, 78590, 78709, 78875, 79864, 81316, 
81320, 81409, 81737, 81843, 82102, 82833, 83461, 83624, 83660, 83701, 83708, 83782, 85707, 85717, 
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86486, 86833, 87115, 87234, 87479, 87561, 87604, 87674, 87958, 87992, 88019, 88074, 88079, 88115, 
88118, 88120, 88135, 88142, 88143, 88149, 88340, 88344, 88512, 88521, 88650, 88827, 89230, 89236, 
90754, 90984, 91 110, 92026, 92954, 93375, 93794, 94937, 95068, 96188, 97092 and 98812. 

5. The method of claim 1 , wherein the one or more polymorphic variations are detected at one 
or more positions in SEQ ID NO: 1 selected from the group consisting of 20300, 87958, 89236, 30267, 
32657, 36289, 38869, 45051, 46257, 54112, 60307, 63499, 20891, 52699 and 71768. 

6. The method of claim 1 , wherein the one or more polymorphic variations are detected at one 
or more positions in linkage disequilibrium with one or more positions in claim 3, 4 or 5. 

7. The method of claim 1 , wherein detecting the presence or absence of the one or more 
polymorphic variations comprises: 

hybridizing an oligonucleotide to the nucleic acid sample, wherein the oligonucleotide is 
complementary to a nucleotide sequence in the nucleic acid and hybridizes to a region adjacent to the 
polymorphic variation; 

extending the oligonucleotide in the presence of one or more nucleotides, yielding extension 
products; and 

detecting the presence or absence of a polymorphic variation in the extension products. 

8. The method of claim 1 , wherein the subject is a human. 

9. The method of claim 8, wherein the subject is a human female. 

10. The method of claim 8, wherein the subject is a human male. 

1 1 . A method for identifying a polymorphic variation associated with osteoarthritis proximal to 
an incident polymorphic variation associated with osteoarthritis, which comprises: 

identifying a polymorphic variation proximal to the incident polymorphic variation associated 
with osteoarthritis, wherein the polymorphic variation is detected in a nucleotide sequence selected from 
the group consisting of: 

(a) a nucleotide sequence in SEQ ID NO: 1 -7; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-7; 
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(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1-7; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c) comprising a polymorphic 

variation; 

determining the presence or absence of an association of the proximal polymorphic variant with 
osteoarthritis. 

12. The method of claim 11, wherein the incident polymorphic variation is at one or more 
positions in claim 3, 4 or 5. 

13. The method of claim 1 1, wherein the proximal polymorphic variation is within a region 
between about 5 kb 5' of the incident polymorphic variation and about 5 kb 3' of the incident 
polymorphic variation. 

14. The method of claim 1 1, which further comprises determining whether the proximal 
polymorphic variation is in linkage disequilibrium with the incident polymorphic variation. 

15. The method of claim 1 1, which further comprises identifying a second polymorphic variation 
proximal to the identified proximal polymorphic variation associated with osteoarthritis and determining 
if the second proximal polymorphic variation is associated with osteoarthritis. 

16. The method of claim 1 5, wherein the second proximal polymorphic variant is within a region 
between about 5 kb 5' of the incident polymorphic variation and about 5 kb 3' of the proximal 
polymorphic variation associated with osteoarthritis. 

17. An isolated nucleic acid comprising a nucleotide sequence selected from the group consisting 

of: 

(a) a nucleotide sequence in SEQ ID NO: 1-7; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-7; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1-7; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c) comprising a polymorphic 
variation; and 

(e) a nucleotide sequence complementary to the nucleotide sequences of (a), (b), (c), or (d); 
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wherein the nucleotide sequence comprises a polymorphic variation associated with osteoarthritis 
selected from the group consisting of an adenine at position 20300, a thymine at position 46257, an 
adenine at position 89236, a guanine at position 30267, an adenine at position 32657, a cytosine at 
position 36289, a guanine at position 38869, a thymine at position 45051, a guanine at position 541 12, an 
adenine at position 60307, a thymine at position 63499, a guanine at position 20891 , a guanine at position 
52699, and a cytosine at position 71768. 

18. An oligonucleotide comprising a nucleotide sequence complementary to a portion of the 
nucleotide sequence of (a), (b), (c), or (d) in claim 17, wherein the 3' end of the oligonucleotide is 
adjacent to a polymorphic variation associated with osteoarthritis. 

1 9. A microarray comprising an isolated nucleic acid of claim 1 7 linked to a solid support. 

20. An isolated polypeptide encoded by the isolated nucleic acid sequence of claim 17. 

21. A method for identifying a candidate therapeutic for treating osteoarthritis, which comprises: 

(a) introducing a test molecule to a system which comprises a nucleic acid comprising a 
nucleotide sequence selected from the group consisting of: 

(i) a nucleotide sequence in SEQ ID NO: 1 -7; 

(ii) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide 
sequence in SEQ ID NO: 1-7; 

(iii) a nucleotide sequence which encodes a polypeptide that is 90% or more identical 
to the amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1-7; 

(iv) a fragment of a nucleotide sequence of (a), (b), or (c); or 

introducing a test molecule to a system which comprises a protein encoded by a nucleotide 
sequence of (i), (ii), (iii), or (iv); and 

(b) determining the presence or absence of an interaction between the test molecule and the 
nucleic acid or protein, 

whereby the presence of an interaction between the test molecule and the nucleic acid or protein 
identifies the test molecule as a candidate therapeutic for treating osteoarthritis. 

22. The method of claim 21 , wherein the system is an animal. 

23. The method of claim 21 , wherein the system is a cell. 
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24. The method of claim 21, wherein the nucleotide sequence comprises one or more 
polymorphic variations associated with osteoarthritis. 

25. The method of claim 24, wherein the one or more polymorphic variations associated with 
osteoarthritis are at one or more positions in claim 3, 4 or 5. 

26. A method for treating osteoarthritis in a subject, which comprises contacting one or more 
cells of a subject in need thereof with a nucleic acid, wherein the nucleic acid comprises a nucleotide 
sequence selected from the group consisting of: 

(a) a nucleotide sequence in SEQ ID NO: 1-7; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-7; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1 -7; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c); and 

(e) a nucleotide sequence complementary to the nucleotide sequences of (a), (b), (c), or (d); 
whereby contacting the one or more cells of the subject with the nucleic acid treats the 

osteoarthritis in the subject. 

27. The method of claim 25, wherein the nucleic acid is RNA or PNA. 

28. The method of claim 27, wherein the nucleic acid is duplex RNA. 

29. A method for treating osteoarthritis in a subject, which comprises contacting one or more 
cells of a subject in need thereof with a protein, wherein the protein is encoded by a nucleotide sequence 
which comprises a polynucleotide sequence selected from the group consisting of: 

(a) a nucleotide sequence in SEQ ID NO: 1 -7; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-7; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1-7; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c); 

whereby contacting the one or more cells of the subject with the protein treats the osteoarthritis 
in the subject. 
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30. A method for treating osteoarthritis in a subject, which comprises: 

detecting the presence or absence of one or more polymorphic variations associated with 
osteoarthritis in a nucleic acid sample from a subject, wherein the one or more polymorphic variation are 
detected in a nucleotide sequence selected from the group consisting of: 

(a) a nucleotide sequence in SEQ ID NO: 1 -7; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-7; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1 -7; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c) comprising a polymorphic 
variation; and 

administering an osteoarthritis treatment to a subject in need thereof based upon the presence or 
absence of the one or more polymorphic variations in the nucleic acid sample. 

31 . The method of claim 30, wherein the one or more polymorphic variations are detected at one 
or more positions in claim 3, 4 or 5. 

32. The method of claim 30, wherein the treatment is selected from the group consisting of 
administering a corticosteroid, a nonsteroidal anti-inflammatory drug (NSATD), a cyclooxygenase-2 
(COX-2) inhibitor, an antibody, a glucocorticoid, hyaluronic acid, chondrotin sulfate, glucosamine or 
acetaminophen; prescribing a heat/cold regimen or a joint protection regimen; performing joint surgery; 
prescribing a weight control regimen; and combinations of the foregoing. 

33. A method for detecting or preventing osteoarthritis in a subject, which comprises: 
detecting the presence or absence of one or more polymorphic variations associated with 

osteoarthritis in a nucleic acid sample from a subject, wherein the polymorphic variation is detected in a 
nucleotide sequence selected from the group consisting of: 

(a) a nucleotide sequence in SEQ ID NO: 1 -7; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-7; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1 -7; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c) comprising a polymorphic 
variation; and 
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administering an osteoarthritis prevention or detection procedure to a subject in need thereof 
based upon the presence or absence of the one or more polymorphic variations in the nucleic acid sample. 

34. The method of claim 33, wherein the one or more polymorphic variations are detected at one 
or more positions in claim 3, 4 or 5. 

35. The method of claim 33, wherein the osteoarthritis prevention is selected from the group 
consisting of administering a corticosteroid, a nonsteroidal anti-inflammatory drug (NSAJD), a 
cyclooxygenase-2 (COX-2) inhibitor, an antibody, a glucocorticoid, hyaluronic acid, chondrotin sulfate, 
glucosamine or acetaminophen; prescribing a heat/cold regimen or a joint protection regimen; performing 
joint surgery; prescribing a weight control regimen; and combinations of the foregoing. 

36. A method of targeting information for preventing or treating osteoarthritis to a subject in 
need thereof, which comprises: 

detecting the presence or absence of one or more polymorphic variations associated with 
osteoarthritis in a nucleic acid sample from a subject, wherein the polymorphic variation is detected in a 
nucleotide sequence selected from the group consisting of: 

(a) a nucleotide sequence in SEQ ID NO: 1-7; 

(b) a nucleotide sequence which encodes a polypeptide encoded by a nucleotide sequence in 
SEQ ID NO: 1-7; 

(c) a nucleotide sequence which encodes a polypeptide that is 90% or more identical to the 
amino acid sequence encoded by a nucleotide sequence in SEQ ID NO: 1 -7; 

(d) a fragment of a nucleotide sequence of (a), (b), or (c) comprising a polymorphic 
variation; and 

directing information for preventing or treating osteoarthritis to a subject in need thereof based 
upon the presence or absence of the one or more polymorphic variations in the nucleic acid sample. 

37. The method of claim 36, wherein the one or more polymorphic variations are detected at one 
or more positions in claim 3, 4 or 5. 

38. A composition comprising a cell from a subject having osteoarthritis or at risk of 
osteoarthritis and an antibody that specifically binds to a protein, polypeptide or peptide encoded by a 
nucleotide sequence identical to or 90% or more identical to a nucleotide sequence in SEQ ED NO: 1-7. 
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39. A composition comprising a cell from a subject having osteoarthritis or at risk of 
osteoarthritis and a RNA, DNA, PNA or ribozyme molecule comprising a nucleotide sequence identical 
to or 90% or more identical to a portion of a nucleotide sequence in SEQ ID NO: 1-7. 

40. The composition of claim 38, wherein the RNA molecule is a short inhibitory RNA 
molecule. 
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Abstract of the Disclosure 



Provided herein are methods for identifying a risk of osteoarthritis in a subject, reagents and kits 
for carrying out the methods, methods for identifying candidate therapeutics for treating osteoarthritis, 
and therapeutic and preventative methods applicable to osteoarthritis. These embodiments are based 
upon an analysis of polymorphic variations in nucleotide sequences within the human genome. 
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